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NOVEL METHODS OF DIAGNOSIS OF METASTATIC COLORECTAL 
CANCER, COMPOSITIONS AND METHODS OF SCREENING FOR 
MODULATORS OF METASTATIC COLORECTAL CANCER 

CROSS-REFERENCES TO RELATED APPLICATIONS 
The present application is related to USSN 60/272,206, filed February 27, 
2001, USSN 60/281,149, filed April 2, 2001, and USSN 60/284,555, filed April 17, 2001, all 
of which are herein incorporated by referenced in their entirety. 

FIELD OF THE INVENTION 
The invention relates to the identification of nucleic acid and protein 
expression profiles and nucleic acids, products, and antibodies thereto that are involved in " 
metastatic colorectal cancer, and to the use of such expression profiles and compositions in 
diagnosis and therapy of metastatic colorectal cancer. The invention further relates to 
methods for identifying and using agents and/or targets that inhibit metastatic colorectal 
cancer. 

BACKGROUND OF THE INVENTION 
Cancer of the colon and/or rectum (referred to as "colorectal cancer") are 
significant in Western populations and particularly in the United States. Cancers of the colon 
and rectum occur in both men and women most commonly after the age of 50. These 
develop as the result of a pathologic transformation of normal colon epithelium to an invasive 
cancer. There have been a number of recently characterized genetic alterations that have - 
been implicated in colorectal cancer, including mutations in two classes of genes, tumor- 
suppressor genes and proto-oncogenes, with recent work suggesting that mutations in DNA 
repair genes may also be involved in tumorigenesis. For example, inactivating mutations of 
both alleles of the adenomatous polyposis coli (APQ gene, a tumor suppressor gene, appears 
to be one of the earliest events in colorectal cancer, and may even be the initiating event 
Other genes implicated in colorectal cancer include the MCC gene, the p53 gene, the DCC 
(deleted in colorectal carcinoma) gene and other chromosome 18q genes, and genes in die 
TGF-P signaling pathway. For a review, see Molecular Biology of Colorectal Cancer, pp. 
238-299, in Curr. Probl Cancer, Sept/Oct 1997; see also Willams, Colorectal Cancer 
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(1996); Kinsella & Schofield, Colorectal Cancer: A Scientific Perspective (1993); Colorectal 
Cancer: Molecular Mechanisms, Premalignant State and its Prevention (Schmiegel & 
Scholmerich eds., 2000); Colorectal Cancer: New Aspects of Molecular Biology and Their 
Clinical Applications (Hanski et al. 9 eds 2000); McArdle et aL, Colorectal Cancer (2000); 
Wanebo, Colorectal Cancer (1993); Levin, The American Cancer Society: Colorectal Cancer 
(1999); Treatment of Hepatic Metastases of Colorectal Cancer (Nordlinger & Jaeck eds., 
1993); Management of Colorectal Cancer (Dunitz et al. 9 eds. 1998); Cancer: Principles and 
Practice of Oncology (Devita et al. 9 eds. 2001); Surgical Oncology: Contemporary Principles 
and Practice (Kiiby et al. 9 eds. 2001); Offit, Clinical Cancer Genetics: Risk Counseling and 
Management (1997); Radioimmunotherapy of Cancer (Abrams & Fritzberg eds. 2000); 
Fleming, AJCC Cancer Staging Handbook (1998); Textbook of Radiation Oncology (Leibel 
& Phillips eds. 2000); and Clinical Oncology (Abeloff et aL, eds. 2000). 

Imaging of colorectal cancer for diagnosis has been problematic and limited. 
In addition, metastasis of the tumor to the lumen, and metastasis of tumor cells to regional 
lymph nodes are important prognostic factors (see, eg., PET in Oncology: Basics and 
Clinical Application (Ruhlmann et aL eds. 1999). For example, five year survival rates drop 
from 80 percent in patients with no lymph node metastases to 45 to 50 percent in those 
patients who do have lymph node metastases. A recent report showed that micrometastases 
can be detected from lymph nodes using reverse transcriptase-PCR methods based on the 
presence of mKNA for carcinoembryonic antigen, which has previously been shown to be 
present in the vast majority of colorectal cancers but not in normal tissues. Liefers et a/., New 
England J. of Med 339(4):223 (1998). In addition, colorectal cancers often metastasize to 
the liver. However, the lack of information about the gene expression exhibited by these 
cancers limits die ability to effectively diagnose and treat the disease. 

Thus, methods for diagnosis and prognosis of metastatic colorectal cancer and 
effective treatment of colorectal cancer would be desirable. Accordingly, provided herein are 
methods that can be used in diagnosis and prognosis of metastatic colorectal cancer. Further 
provided are methods that can be used to screen candidate therapeutic agents for the ability to 
modulate, e.g., treat, colorectal cancer. Additionally, provided herein are molecular targets 
and compositions for therapeutic intervention in metastatic colorectal disease and other 
metastatic cancers. 

SUMMARY OF THE INVENTION 
The present invention therefore provides nucleotide sequences of genes that are up- and 
down-regulated in metastatic colorectal cancer cells. Such genes and the proteins they 
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encode are useful for diagnostic and prognostic purposes, and also as targets for screening for 
therapeutic compounds that modulate metastatic colorectal cancer, such as antibodies. The 
methods of detecting nucleic acids of the invention or their encoded proteins can be used for 
a number of purposes. Examples include, early detection of colon cancers, monitoring and 
early detection of relapse following treatment of colon cancers, monitoring response to 
therapy of colon cancers, determining prognosis of colon cancers, directing therapy of colon 
cancers, selecting patients for postoperative chemotherapy or radiation therapy, selecting 
therapy, determining tumor prognosis, treatment, or response to treatment, and early 
detection of precancerous colon adenomas. Other aspects of the invention will become 
apparent to the skilled artisan by the following description of the invention. 

In one aspect, the present invention provides a method of detecting a 
metastatic colorectal cancer-associated transcript in a cell from a patient, die method 
comprising contacting a biological sample from the patient with a polynucleotide that - ~ 
selectively hybridizes to a sequence at least 80% identical to a sequence as shown in Tables 
1-26. 

In one embodiment, die polynucleotide selectively hybridizes to a sequence at 
least 95% identical to a sequence as shown in Tables 1-26. In another embodiment, the 
polynucleotide comprises a sequence as shown in Tables 1-26. 

In one embodiment, the biological sample is a tissue sample. In another 
embodiment, the biological sample comprises isolated nucleic acids, e.g., mRNA. 

In one embodiment, the polynucleotide is labeled, e.g., with a fluorescent 

label 

In one embodiment, the polynucleotide is immobilized on a solid surface. 

In one embodiment, the patient is undergoing a therapeutic regimen to treat 
metastatic colorectal cancer. In another embodiment, the patient is suspected of having 
metastatic colorectal cancer. 

In one embodiment, the patient is a human. 

In one embodiment, the method farther comprises the step of amplifying 
nucleic acids before the step of contacting the biological sample with the polynucleotide. 

In another aspect, the present invention provides methods of detecting 
polypeptide encoded by a metastatic colorectal cancer-associated transcript in a cell from a 
patient, the method comprising contacting a biological sample from the patient with an 
antibody that specifically binds a polypeptide encoded by a sequence at least 80% identical to 
a sequence as shown in Tables 1-26. 
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In another aspect, the present invention provides a method of monitoring the 
efficacy of a therapeutic treatment of metastatic colorectal cancer, the method comprising the 
steps of: (i) providing a biological sample from a patient undergoing the therapeutic 
treatment; and (ii) determining the level of a metastatic colorectal cancer-associated transcript 
in the biological sample by contacting the biological sample with a polynucleotide that 
selectively hybridizes to a sequence at least 80% identical to a sequence as shown in Tables 
1-26., thereby monitoring the efficacy of the therapy. 

In one embodiment, the method further comprises the step of: (iii) comparing 
the level of the metastatic colorectal cancer-associated transcript to a level of the metastatic 
colorectal cancer-associated transcript in a biological sample from the patient prior to, or 
earlier in, the therapeutic treatment 

In another aspect, the present invention provides a method of monitoring the 
efficacy of a therapeutic treatment of metastatic colorectal cancer, the method comprising the 
steps of: 0) providing a biological sample from a patient undergoing the therapeutic 
treatment; and (ii) determining the level of a metastatic colorectal cancer-associated antibody 
in die biological sample by contacting the biological sample with a polypeptide encoded by a 
polynucleotide that selectively hybridizes to a sequence at least 80% identical to a sequence 
as shown in Tables 1-26, wherein the polypeptide specifically binds to the metastatic 
colorectal cancer-associated antibody, thereby monitoring the efficacy of the therapy. 

In one embodiment, the method further comprises the step of: (iii) comparing 
the level of the metastatic colorectal cancer-associated antibody to a level of die metastatic 
colorectal cancer-associated antibody in a biological sample from the patient prior to, or 
earlier in, the therapeutic treatment 

In another aspect, the present invention provides a method of monitoring the 
efficacy of a therapeutic treatment of metastatic colorectal cancer, the method comprising the 
steps of: (i) providing a biological sample from a patient undergoing the therapeutic 
treatment; and (if) determining the level of a metastatic colorectal cancer-associated 
polypeptide in the biological sample by contacting the biological sample with an antibody, 
* wherein the antibody specifically binds to a polypeptide encoded by a polynucleotide that 

selectively hybridizes to a sequence at least 80% identical to a sequence as shown in Tables 
1-26, thereby monitoring the efficacy of the therapy. 

In one embodiment, the method further comprises the step of: (iii) comparing 
the level of the metastatic colorectal cancer-associated polypeptide to a level of the metastatic 
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colorectal cancer-associated polypeptide in a biological sample from the patient prior to, or 
earlier in, the therapeutic treatment 

In one aspect, the present invention provides an isolated nucleic acid molecule 
consisting of a polynucleotide sequence as shown in Tables 1-26. 

In one embodiment, an expression vector or cell comprises the isolated nucleic 

acid 

In one aspect, the present invention provides an isolated polypeptide which is 
encoded by a nucleic acid molecule having polynucleotide sequence as shown in Tables 1-26. 

In another aspect, the present invention provides an antibody that specifically 
% binds to an isolated polypeptide which is encoded by a nucleic acid molecule having 
polynucleotide sequence as shown in Tables 1-26. 

, In one embodiment, the antibody is conjugated to an effector component, e.g., 
a fluorescent label, a radioisotope or a cytotoxic chemical. . ; 

In one embodiment, the antibody is an antibody fragment. In another 
embodiment, the antibody is humanized 

In one aspect, the present invention provides a method of detecting a 
metastatic colorectal cancer cell in a biological sample from a patient, the method comprising 
contacting the biological sample with an antibody as described herein. 

In another aspect, the present invention provides a method of detecting 
antibodies specific to metastatic colorectal cancer in a patient, the method comprising 
contacting a biological sample from the patient with a polypeptide encoded by a nucleic acid 
comprises a sequence from Tables 1-26. 

In another aspect, the present invention provides a method for identifying a 
compound that modulates a metastatic colorectal cancer-associated polypeptide, the method 
comprising the steps of: (i) contacting the compound with a metastatic colorectal cancer- 
associated polypeptide, the polypeptide encoded by a polynucleotide that selectively 
hybridizes to a sequence at least 80% identical to a sequence as shown in Tables 1-26; and 
(ii) determining the functional effect of the compound upon the polypeptide. 

In one embodiment, the functional effect is a physical effect, an enzymatic 
effect, or a chemical effect 

In one embodiment, die polypeptide is expressed in a eukaryotic host cell or 
cell membrane. In another embodiment, the polypeptide is recombinant 

In one embodiment, die functional effect is determined by measuring ligand 
binding to the polypeptide. 
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In another aspect, the present invention provides a method of inhibiting 
proliferation of a metastatic colorectal cancer-associated cell to treat colorectal cancer in a 
patient, the method comprising the step of administering to the subject a therapeutically 
effective amount of a compound identified as described herein. 

In one embodiment, the compound is an antibody. 

In another aspect, the present invention provides a drag screening assay 
comprising the steps of: (i) adminis tering a test compound to a mammal having colorectal 
cancer or a cell isolated therefrom; (ii) comparing the level of gene expression of a 
polynucleotide that selectively hybridizes to a sequence at least 80% identical to a sequence 
as shown in Tables 1-26. in a treated cell or mammal with die level of gene expression of the 
polynucleotide in a control cell or mammal, wherein a test compound that modulates the level 
of expression of the polynucleotide is a candidate for the treatment of colorectal cancer. 

In one embodiment, the control is a mammal with colorectal cancer or a cell 
therefrom that has not been treated with the test compound. In another embodiment, the 
control is a normal cell or mammal. 

In another aspect, die present invention provides a method for treating a 
mammal having colorectal cancer comprising administering a compound identified by the 
assay described herein. 

In another aspect, the present invention provides a pharmaceutical 
composition for treating a mammal having colorectal cancer, the composition comprising a 
compound identified by the assay described herein and a physiologically acceptable 
excipient 

DETAILED DESCRIPTION OF THE INVENTION 
In accordance with the objects outlined above, the present invention provides 
novel methods for diagnosis and treatment of colon and/or rectal cancer (e.g., colorectal 
cancer), including metastatic colorectal cancels, as well as methods for screening for 
compositions which modulate colorectal cancer. By "metastatic colorectal cancer" herein is 
meant a colon and/or rectal tumor or cancer that is classified as Dukes stage C or D {see, e.g., 
Cohen et aL, Cancer of the Colon, in Cancer: Principles and Practice of Oncology, pp. 1144- 
1 197 (Devita et a/., eds., 5 th ed. 1997); see also Harrison *s Principles of Interned Medicine, 
pp. 1289-129 (Wilson et aL , eds., 12 th ed, 1991). •Treatment, monitoring, detection or 
modulation of metastatic colorectal cancer" includes treatment, monitoring, detection, or 
modulation of metastatic colorectal disease in those patients who have metastatic colorectal 



6 



WO 02/068677 PCT/US02/06001 

disease (Dukes stage C or D). In Dukes stage A, the tumor has penetrated into, but not 
through, the bowel walL hi Dukes stage B, the tumor has penetrated through the bowel wall 
but there is not yet any lymph involvement In Dukes stage C, the cancer involves regional 
lymph nodes. In Dukes stage D, there is distant metastasis, e.g., liver, lung, etc. 

Tables. 1 -26 provide UniGene cluster identification numbers for the nucleotide 
sequence of genes that exhibit increased or decreased expression in metastasizing colorectal 
cancer samples. Tables 1-26 also provide an exemplar accession number that provides a 
nucleotide sequence that is part of the UniGene cluster. In Tables 1-26, the ratio provided 
represents primary tumor samples from known Dukes B stage survivors vs. liver metastasis 
samples from patients with metastatic colorectal cancer. In these samples, the identified 
genes are underexpressed in the metastatic samples, as the ratio is greater than one, preferably 
1.5 or greater, more preferably 2.0 or greater. In Tables 1-26, the ratio provided represents 
liver metastasis samples from patients with known metastatic colorectal cancer vs. known ~ 
primary tumor samples from Dukes B stage survivors. In these samples, the identified genes 
are overexpressed in the metastatic samples, as the ratio is greater than one, preferably 1.5 or 
greater, more preferably 2.0 or greater. In Tables 1-26, the ratio provided represents primary 
tumor samples from known Dukes B stage survivors vs. liver metastasis samples from 
patients with metastatic colorectal cancer. In these samples, the identified genes are 
overexpressed in the metastatic samples, as the ratio is less than one, preferably 0.5 or less, 
more preferably 0.25 or less. Survivors are subjects who have been disease free for five years 
or longer. 

In Tables 1-26, the ratio provided represents liver metastasis samples from 
patients with known metastatic disease vs. tissue samples from normal colon tissue. In these 
samples, the identified genes are overexpressed in the metastatic samples, as the ratio is 
greater than one, preferably 1 .5 or greater, more preferably 2.0 or greater. In Tables 1-26, the 
ratio represents liver metastasis samples from patients with known metastatic disease vs. 
tissue samples from normal colon tissue. In these samples, the identified genes are 
underexpressed in the metastatic samples, as the ratio is less than one, preferably 0.5 or less, 
more preferably 0.25 or less. 

One of skill will recognize that although the sequences identified in Tables 1- 
26 exhibited increased or decreased expression in metastasizing colorectal cancer samples, 
the sequences of the invention, and their encoded proteins, can be used to diagnose, treat or 
prevent cancers in patients with Dukes stage A or B colorectal cancers. Alteration of gene 
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expression for a gene in Tables 1 -26 may be more likely or less likely to indicate that the 
subject will progress to metastatic disease. The sequences can also be used to diagnose, treat 
or prevent precancerous or benign conditions such as precancerous colon adenomas. 
Alteration of gene expression for a gene in Tables 1-26 may or may not indicate that the 
subject is more likely to progress to cancer or to metastatic disease. Thus, although the 
specification focuses primarily on metastasizing colorectal cancer, the methods described 
below can also be applied to non- metastasizing colorectal cancers {e.g., Dukes stages A and 
B) and precancerous or benign conditions (e.g. 9 precancerous adenomas) as well. 

Definitions 

The term "metastatic colorectal cancer protein" or "metastatic colorectal 
cancer polynucleotide" or "metastatic colorectal cancer-associated transcript" refers to 
nucleic acid and polypeptide polymorphic variants, alleles, mutants, and interspecies 
homologs that: (1) have a nucleotide sequence that has greater than about 60% nucleotide 
sequence identity, 65%, 70%, 75%, 80%, 85%, 90%, preferably 91%, 92%, 93%, 94%, 95%, 
96%, 97%, 98% or 99% or greater nucleotide sequence identity, preferably over a region of 
over a region of at least about 25, 50, 100, 200, 500, 1000, or more nucleotides, to a 
nucleotide sequence of or associated with a UniGene cluster of Tables 1-26; (2) bind to 
antibodies, e.g., polyclonal antibodies, raised against an immunogen comprising an amino 
acid sequence encoded by a nucleotide sequence of or associated with a UniGene cluster of 
Tables 1-26, and conservatively modified variants thereof; (3) specifically hybridize under 
stringent hybridization conditions to a nucleic acid sequence, or the complement thereof of 
Tables 1-26 and conservatively modified variants thereof or (4) have an amino acid sequence 
that has greater than about 60% amino acid sequence identity, 65%, 70%, 75%, 80%, 85%, 
90%, preferably 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% or greater amino 
sequence identity, preferably over a region of over a region of at least about 25, 50, 100, 200, 
500, 1000, or more amino acid, to an amino acid sequence encoded by a nucleotide sequence 
of or associated with a UniGene cluster of Tables 1-26. A polynucleotide or polypeptide 
sequence is typically from a mammal including, but not limited to, primate, e.g., human; 
rodent, e.g., rat, mouse, hamster, cow, pig, horse, sheep, or other mammal. A "metastatic 
colorectal cancer polypeptide" and a Metastatic colorectal cancer polynucleotide," include 
both naturally occurring or recombinant 
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A "full length" metastatic colorectal cancer protein or nucleic acid refers to a 
metastatic colorectal cancer polypeptide or polynucleotide sequence, or a variant thereof; that 
contains all of the elements normally contained in one or more naturally occurring, wild type 
metastatic colorectal cancer polynucleotide or polypeptide sequences. The "full length" may 
be prior to, or after, various stages of post-translation processing or splicing, including 
alternative splicing. 

"Biological sample" as used herein is a sample of biological tissue or fluid that 
contains nucleic acids or polypeptides, e.g., of a metastatic colorectal cancer protein, 
polynucleotide or transcript Such samples include, but are not limited to, tissue isolated 
from primates, e.g., humans, or rodents, e.g., mice, and rats. Biological samples may also 
include sections of tissues such as biopsy and autopsy samples, frozen sections taken for 
histologic purposes, blood, plasma, serum, sputum, stool, tears, mucus, hair, skin, etc. 
Biological samples also include explants and primary and/or transformed cell cultures ~- 
derived from patient tissues. A biological sample is typically obtained from a eukaiyotic 
organism, most preferably a mammal such as a primate, e.g., chimpanzee or human; cow; 
dog; cat; a rodent, e.g., guinea pig, rat, mouse; rabbit; or other mammal; or a bird; reptile; 
fish. 

"Providing a biological sample" means to obtain a biological sample for use in 
methods described in this invention. Most often, this will be done by removing a sample of 
cells from an animal, but can also be accomplished by using previously isolated cells (e.g., 
isolated by another person, at another time, and/or for another purpose), or by performing the 
methods of the invention in vivo. Archival tissues, having treatment or outcome history, will 
be particularly usefiiL 

The terns "identical" or percent "identity," in the context of two or more 
nucleic acids or polypeptide sequences, refer to two or more sequences or subsequences that 
are the same or have a specified percentage of amino acid residues or nucleotides that are the 
same (ie., about 60% identity, preferably 70% 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 
95%, 96%, 97%, 98%, 99%, or higher identity over a specified region, when compared and 
aligned for maximum correspondence over a comparison window or designated region) as 
measured using a BLAST or BLAST 2.0 sequence comparison algorithms with default 
parameters described below, or by manual alignment and visual inspection (see, eg., NCBI 
web site http://wwwjicbi.nlm .nib .gov/BLAST/ or the like). Such sequences are thai said to 
be "substantially identical" This definition also refers to, or may be applied to, the 
compliment of a test sequence. The definition also includes sequences that have deletions 
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and/or additions, as well as those that have substitutions, as well as naturally occurring, e.g., 
polymorphic or allelic variants, and man-made variants. As described below, the preferred 
algorithms can account for gaps and the like. Preferably, identity exists over a region that is 
at least about 25 amino acids or nucleotides in length, or more preferably over a region that is 
50-100 amino acids or nucleotides in length. 

For sequence comparison, typically one sequence acts as a reference sequence, 
to which test sequences are compared When using a sequence comparison algorithm, test 
and reference sequences are entered into a computer, subsequence coordinates are designated, 
if necessary, and sequence algorithm program parameters are designated. Preferably, default 
program parameters can be used, or alternative parameters can be designated. The sequence 
comparison algorithm then calculates the percent sequence identities for the test sequences 
relative to the reference sequence, based on the program parameters. 

A "comparison window", as used herein, includes reference to a segment of 
one of the number of contiguous positions selected from the group consisting typically of 
from 20 to 600, usually about 50 to about 200, more usually about 100 to about 150 in which 
a sequence may be compared to a reference sequence of the same number of contiguous 
positions after the two sequences are optimally aligned Methods of alignment of sequences 
for comparison are well-known in the art Optimal alignment of sequences for comparison 
can be conducted, e.g., by the local homology algorithm of Smith & Waterman, Adv. Appl 
Math 2'Ail (1981), by the homology alignment algorithm of Needleman & Wunsch, J. Mol 
Biol 48 :443 (1 970), by the search for similarity method of Pearson & Lipman, Proc Nat 1 
Acad ScL USA 85:2444 (1988), by computerized implementations of these algorithms 
(GAP, BESTFTT, FASTA, and TFASTA in the Wisconsin Genetics Software Package, 
Gaieties Computer Group, 575 Science Dr., Madison, WT), or by manual alignment and 
visual inspection (see, eg., Current Protocols in Molecular Biology (Ausubel et al. 9 eds. 
1995 supplement)). 

Preferred examples of algorithms that are suitable for determining percent 
, sequence identity and sequence similarity include the BLAST and BLAST 2.0 algorithms, 
which are described in Altschul et al, Nua Acids Res. 25:3389-3402 (1977) and Altschul et 
al, J. Mol. Biol 215:403-410 (1990). BLAST and BLAST 2.0 are-used, with the parameters 
described herein, to determine percent sequence identity for the nucleic acids and proteins of 
the invention. Software for performing BLAST analyses is publicly available through the 
National Center for Biotechnology Information (http:/Avww Jicbi jdmnih.gov/). This 
algorithm involves first identifying high scoring sequence pairs (HSPs) by identifying short 
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words of length Win the query sequence, which either match or satisfy some positive-valued 
threshold score T when aligned with a word of the same length in a database sequence. Tis 
referred to as (he neighborhood word score threshold (Altschul et aL, supra). These initial 
neighborhood word hits act as seeds for initiating searches to find longer HSPs containing 
them. The word hits are extended in both directions along each sequence for as fir as the 
cumulative alignment score can be increased Cumulative scores are calculated using, e.g., 
for nucleotide sequences, the parameters M (reward score for a pair of matching residues; 
always > 0) and N (penalty score for mismatching residues; always < 0). For amino acid 
sequences, a scoring matrix is used to calculate the cumulative score. Extension of the word 
hits in each direction are halted when: the cumulative alignment score fills off by the 
quantity X from its maximum achieved value; die cumulative score goes to zero or below, 
due to the accumulation of one or more negative-scoring residue alignments; or the end of 
either sequence is reached The BLAST algorithm parameters W, T, and X determine the ~ 
sensitivity and speed of the alignment The BLASTN program (for nucleotide sequences) 
uses as defaults a wordlength (W) of 11, an expectation (E) of 10, M=5, N=^4 and a 
comparison of both strands. For amino acid sequences, the BLAST? program uses as 
defaults a wordlength of 3, and expectation (E) of 10, and the BLOSUM62 scoring matrix 
(see Henikoff & Hemkofl^ Proa Natl Acad. ScL USA 89:10915 (1989)) alignments (B) of 
50, expectation (E) of 10, M=5, N=-4, and a comparison of both strands. 

The BLAST algorithm also performs a statistical analysis of die similarity 
between two sequences (see, ag., Kariin & Altschul, Proa Nat T Acad. ScL USA 90:5873- 
5787 (1993)). One measure of similarity provided by die BLAST algorithm is die smallest 
sum probability (P(N))> which provides an indication of the probability by which a match 
between two nucleotide or amino acid sequences would occur by chance. For example, a 
nucleic acid is considered similar to a reference sequence if the smallest sum probability in a 
comparison of the test nucleic acid to the reference nucleic acid is less than about 0.2, more 
preferably less than about 0.01, and most preferably less than about 0.001 . Log values may 
be large negative numbers, e.g., 5, 10, 20, 30, 40, 40, 70, 90, 110, 150, 170, etc. 

An indication that two nucleic acid sequences or polypeptides are substantially 
identical is that the polypeptide encoded by the first nucleic acid is immunologically cross 
reactive with the antibodies raised against die polypeptide encoded by the second nucleic 
acid, as described below. Thus, a polypeptide is typically substantially identical to a second 
polypeptide, e.g., where the two peptides differ only by conservative substitutions. Another 
indication that two nucleic acid sequences are substantially identical is that the two molecules 
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or their complements hybridize to each other under stringent conditions, as described below. 
Yet another indication that two nucleic acid sequences axe substantially identical is that the 
same primers can be used to amplify the sequences. 

A "host cell" is a naturally occurring cell or a transformed cell that contains an 
expression vector and supports the replication or expression of the expression vector. Host 
cells may be cultured cells, explants, cells in vivo, and the like. Host cells may be 
prokaryotic cells such as E. coli> or eukaryotic cells such as yeast, insect, amphibian, or 
mammalian cells such as GHO, HeLa, and the like (see, e.g„ the American Type Culture 
Collection catalog or web site, www.atcc.org). 

The terms "isolated," "purified," or biologically pure" refer to material that is 
substantially or essentially free from components that normally accompany it as found in its 
native state. Purity and homogeneity are typically determined using analytical chemistry 
.techniques such as polyaciylamide gel electrophoresis or high performance liquid 
chromatography. A protein or nucleic acid that is the predominant species present in a 
preparation is substantially purified. In particular, an isolated nucleic acid is separated from 
some open reading frames that naturally flank the gene and encode proteins other than protein 
encoded by the gene. The term 4 'purified" in some embodiments denotes that a nucleic acid 
or protein gives rise to essentially one band in an electrophoretic gel. Preferably, it means 
that the nucleic acid or protein is at least 85% pure, more preferably at least 95% pure, and 
most preferably at least 99% pure. "Purify" or "purification" in other embodiments means 
removing at least one contaminant from the composition to be purified. In this sense, 
purification does not require that the purified compound be homogenous, e.g., 100% pure. 

The terms "polypeptide," "peptide" and "protein" are used interchangeably 
herein to refer to a polymer of amino acid residues. The terms apply to amino acid polymers 
in which one or more amino acid residue is an artificial chemical mimetic of a corresponding 
naturally occurring amino acid, as well as to naturally occurring amino acid polymers, those 
containing modified residues, and non-naturally occurring amino acid polymer. 

The term "amino acid" refers to naturally occurring and synthetic amino acids, 
as well as amino acid analogs and amino acid mimetics that function similarly to the naturally 
occurring amino acids. Naturally occurring amino acids are those encoded by the genetic 
code, as well as those amino acids that are later modified, e.g., hydroxyproline, y- 
carboxyglutamate, and O-phosphoserine. Amino acid analogs refers to compounds that have 
the same basic chemical structure as a naturally occurring amino acid, e.g., an a carbon that is 
bound to a hydrogen, a carboxyl group, an amino group, and an R group, e.g., homoserine, 
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norleucine, methionine sulfoxide, methionine methyl sulfonium. Such analogs may have 
modified R groups (e.g., norleucrine) or modified peptide backbones, but retain the same basic 
chemical structure as a naturally occurring amino acid. Amino add mimetics refers to 
chemical compounds that have a structure that is different from the general chemical 
structure of an amino acid, but that functions similarly to a naturally occurring amino acid. 

Amino acids may be referred to herein by either their commonly known three 
letter symbols or by the one-letter symbols recommended by the IUPAC-IUB Biochemical 
Nomenclature Commission. Nucleotides, likewise, may be referred to by their commonly 
accepted single-letter codes. 

"Conservatively modified variants" applies to both amino acid and nucleic 
acid sequences. With respect to particular nucleic acid sequences, conservatively modified 
variants refers to those nucleic acids which encode identical or essentially identical amino 
acid sequences, or where the nucleic acid does not encode an amino acid sequence, to 
essentially identical or associated, e.g., naturally contiguous, sequences. Because of the 
degeneracy of the genetic code, a large number of functionally identical nucleic acids encode 
most proteins. For instance, the codons GCA, GCC, GCG and GCU all encode the amino 
add alanine. Thus, at every position where an alanine is specified by a codon, the codon can 
be altered to another of the corresponding codons described without altering the encoded 
polypeptide. Such nucleic acrid variations are "silent variations," which are one species of 
conservatively modified variations. Every nucleic acid sequence herein which encodes a 
polypeptide also describes silent variations of the nucleic acid. One of skill will recognize 
that in certain contexts each codon in a nucleic acrid (except AUG, which is ordinarily the 
only codon for methionine, and TGG, which is ordinarily the only codon for tryptophan) can 
be modified to yield a functionally identical molecule. Accordingly, often silent variations of 
a nucleic acrid which encodes a polypeptide is implicit in a described sequence with respect to 
the expression product, but not with respect to actual probe sequences. 

As to amino acid sequences, one of skill will recognize that individual 
substitutions, deletions or additions to a nucleic acrid, peptide, polypeptide, or protein 
sequence which alters, adds or deletes a single amino acid or a small percentage of amino 
acids in the encoded sequence is a "conservatively modified variant" where the alteration 
results in the substitution of an amino acid with a chemically similar amino acid. 
Conservative sub stitution tables providing functionally similar amino acids are well known in 
the art. Such conservatively modified variants are in addition to and do not exclude 
polymorphic variants, interspecies homology and alleles of the invention. 
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The following eight groups each contain amino acids that are typically 
conservative substitutions for one another. 1) Alanine (A), Glycine (G); 2) Aspartic acid (D), 
Glutamic acid (E); 3) Asparagine (N), Glutamine (Q); 4) Argitime (R), Lysine (K); 5) 
Isoleucine (I), Leucine (L), Methionine (M), Valine (V); 6) Phenylalanine (F), Tyrosine (Y), 
Tryptophan (W); 7) Serine (S), Threonine (T); and 8) Cysteine (C), Methionine (M) (see, 
Creighton, Proteins (1984)). 

Macromolecular structures such as polypeptide structures can be described in 
terms of various levels of organization. For a general discussion of this organization, see, 
e.g, Alberts et al 9 Molecular Biology of the Cell (3 rd ed., 1994) and Cantor & Schimmel, 
Biophysical Chemistry Pari I: The Conformation of Biological Macromolecules (1980). 
'Trimaiy structure" refers to the amino acid sequence of a particular peptide. "Secondary 
structure" refers to locally ordered, three dimensional structures within a polypeptide. These 
structures are commonly known as domains. Domains are portions of a polypeptide that, f 
often form a compact unit of the polypeptide and are typically 25 to approximately 500 
amino acids long. Typical domains are made up of sections of lesser organization such as 
stretches of p-sheet and a-helices. "Tertiary structure" refers to the complete three 
dimensional structure of a polypeptide monomer. "Quaternary structure" refers to the three 
dimensional structure formed, usually by die noncovalent association of independent tertiary 
units. Anisotropic terms are also known as energy terms. 

"Nucleic acid" or "oligonucleotide" or polynucleotide" or grammatical 
equivalents used herein means at least two nucleotides covalently linked together. 
Oligonucleotides are typically from about 5, 6, 7, 8, 9, 10, 12, 15, 25, 30, 40, 50 or more 
nucleotides in length, up to about 100 nucleotides in length. Nucleic acids and 
polynucleotides are a polymers of any length, including longer lengths, e.g., 200, 300, 500, 
1000, 2000, 3000, 5000, 7000, 10,000, etc. A nucleic acid of the present invention will 
generally contain phosphodiester bonds, although in some cases, nucleic acid analogs are 
included that may have alternate backbones, comprising, e.g., phosphoramidate, 
phosphorothioate, phosphorodithioate, or Omethylphophoroamidite linkages (see Eckstein, 
Oligonucleotides and Analogues: A Practical Approach, Oxford University Press); and 
peptide nucleic acid backbones and linkages. Other analog nucleic acids include those with 
positive backbones; non-ionic backbones, and non-ribose backbones, including those 
described in U.S. Patent Nos. 5,235,033 and 5,034,506, and Chapters 6 and 7, ASC 
Symposium Series 580, Carbohydrate Modifications in Antisense Research, Sanghui & 
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Cook, eds.. Nucleic acids containing one or more carbocyclic sugars are also included within 
one definition of nucleic acids. Modifications of the ribose-phosphate backbone may be done 
for a variety of reasons, e.g. to increase the stability and half-life of such molecules in 
physiological environments or as probes on a biochip. Mixtures of naturally occurring 
nucleic acids and analogs can be made; alternatively, mixtures of different nucleic acid 
analogs, and mixtures of naturally occurring nucleic acids and analogs may be made. 

Particularly preferred are peptide nucleic acids (PNA) which includes peptide 
nucleic acid analogs. These backbones are substantially non-ionic under neutral conditions, in 
contrast to the highly charged phosphodiester backbone of naturally occurring nucleic acids. 
This results in two advantages. First, the PNA backbone exhibits improved hybridization 
kinetics. PNAs have larger changes in the melting temperature (T^ for mismatched versus 
perfectly matched basepairs. DNA and RNA typically exhibit a 2-4°C drop in T m for an 
internal mismatch. With the non-ionic PNA backbone, the drop is closer to 7-9°C. Similarly, 
due to their non-ionic nature, hybridization of the bases attached to these backbones is 
relatively insensitive to salt concentration. In addition, PNAs are not degraded by cellular 
enzymes, and thus can be more stable. 

The nucleic acids may be single stranded or double stranded, as specified, or 
contain portions of both double stranded or single stranded sequence. As will be appreciated 
by those in the art, the depiction of a single strand also defines the sequence of the 
complementary strand; thus the sequences described herein also provide the complement of 
the sequence. The nucleic acid may be DNA, both genomic and cDNA, RNA or a hybrid, 
where the nucleic add may contain combinations of deoxyribo- and ribo-nucleotides, and 
combinations of bases, including uracil, adenine, thymine, cytosine, guanine, inosine, 
xanthine hypoxanthine, isocytosine, isoguanine, etc. "Transcript" typically refers to a 
naturally occurring RNA, e.g., apre-mRNA, hnRNA, or mRNA As used herein, the tern 
"nucleoside" includes nucleotides and nucleoside and nucleotide analogs, and modified 
nucleosides such as amino modified nucleosides. In addition, "nucleoside" includes non- 
naturally occurring analog structures. Thus, e.g. the individual units of a peptide nucleic 
acid, each containing abase, are referred to herein as a nucleoside. 

A "laber or a "detectable moiety** is a composition detectable by 
spectroscopic, photochemical, biochemical, immunochemical, chemical, or other physical 
means. For example, useful labels include 32 P 9 fluorescent dyes, electron-dense reagents, 
enzymes (e.g., as commonly used in an ELISA), biotin, digoxigenin, or haptens and proteins 
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or other entities which can be made detectable, e.g., by incorporating a radiolabel into the 
peptide or used to detect antibodies specifically reactive with the peptide. 

An "effector" or "effector moiety" or "effector component" is a molecule that 
is bound (or linked, or conjugated), either covalently, through a linker or a chemical bond, or 
noncovalently, through ionic, van der Waals, electrostatic, or hydrogen bonds, to an antibody. 
The "effector" can be a variety of molecules including, e.g., detection moieties including 
radioactive compounds, fluorescent compounds, an enzyme or substrate, tags such as epitope 
tags, a toxin; activatable moieties, a chemotherapeutic agent; a lipase; an antibiotic; or a 
radioisotope emitting "hard" e.g., beta radiation. 

A "labeled nucleic acid probe or oligonucleotide" is one that is bound, either 
covalently, through a linker or a chemical bond, or noncovalently, through ionic, van der 
Waals, electrostatic, or hydrogen bonds to a label such that the presence of the probe may be 
detected by detecting the presence of the label bound to the probe. Alternatively, method _~ 
using high affinity interactions may achieve the same results where one of a pair of binding 
partners binds to the other, e.g., biotin, streptavidin. 

As used herein a "nucleic acid probe or oligonucleotide" is defined as a 
nucleic add capable of binding to a target nucleic acid of complementary sequence through 
one or more types of chemical bonds, usually through complementary base pairing, usually 
through hydrogen bond formation. As used herein, a probe may include natural (i.e., A, G, C, 
or T) or modified bases (7-deazaguanosine, inosine, etc.). In addition, the bases in a probe 
may be joined by a linkage other than a phosphodiester bond, so long as it does not 
functionally interfere with hybridization. Thus, e.g., probes may be peptide nucleic acids in 
which the constituent bases are joined by peptide bonds rather than phosphodiester linkages. 
It will b e understood by one of skill in the art that probes may bind target sequences lacking 
complete complementarity with the probe sequence depending upon the stringency of the 
hybridization conditions. The probes are preferably directly labeled as with isotopes, 
chromophobes, lumiphores, chromogens, or indirectly labeled such as with biotin to which a 
streptavidin complex may later bind. By assaying for the presence or absence of the probe, 
one can detect the presence or absence of the select sequence or subsequence. Diagnosis or 
prognosis may be based at the genomic level, or at the level of RNA or protein expression. 

The term "recombinant** when used with reference, e.g., to a cell, or nucleic 
acid, protein, or vector^ indicates that the cell, nucleic acid, protein or vector, has been 
modified by the introduction of a heterologous nucleic acid or protein or the alteation of a 
native nucleic acid or protein, or that the cell is derived from a cell so modified. Thus, e.g., 
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recombinant cells express genes that are not found within the native (non-recombinant) form 
of the cell or express native genes that are otherwise abnormally expressed, under expressed 
or not expressed at all. By the term "recombinant nucleic acid** herein is meant nucleic acid, 
originally formed in vitro, in general, by the manipulation of nucleic acid, e.g., using 
polymerases and endonucleases, in a form not normally found in nature. In this manner, 
operably linkage of different sequences is achieved. Thus an isolated nucleic acid, in a linear 
form, or an expression vector formed in vitro by ligating DNA molecules that are not 
normally joined, are both considered recombinant for the purposes of this invention. It is 
understood that once a recombinant nucleic acid is made and reintroduced into a host cell or 
organism, it will replicate non-recombinantly, i.e., using the in vivo cellular machinery of the 
host cell rather than in vitro manipulations; however, such nucleic acids, once produced 
recombinant^, although subsequently replicated non-recombinantly, are still considered 
recombinant for the purposes of the invention. Similarly, a "recombinant protein" is a protein 
made using recombinant techniques, i.e., through the expression of a recombinant nucleic 
acid as depicted above. 

The term "heterologous" when used with reference to portions of a nucleic 
acid indicates that the nucleic acid comprises two or more subsequences that are not normally 
found in the same relationship to each other in nature. For instance, the nucleic acid is 
typically recombinantly produced, having two or more sequences, e.g., from unrelated genes 
arranged to make a new functional nucleic acid, e.g., a promoter from one source and a 
coding region from another source. Similarly, a heterologous protein will often refer to two 
or more subsequences that are not found in the same relationship to each other in nature (e.g., 
a fusion protein). 

A "promoter" is defined as an array of nucleic acid control sequences that 
direct transcription of a nucleic acid As used herein, a promoter includes necessary nucleic 
acid sequences near the start site of transcription, such as, in the case of a polymerase II type 
promoter, a TATA element A promote- also optionally includes distal enhancer or repressor 
elements, which can be located as much as several thousand base pairs from the start site of 
transcription. A "constitutive" promoter is a promoter that is active under most 
environmental and developmental conditions. An "inducible" promoter is a promoter that is 
active under environmental or developmental regulation. The term "operably linked" refers 
to a functional linkage between a nucleic acid expression control sequence (such as a 
promoter, or array of transcription factor binding sites) and a second nucleic acid sequence, 
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wherein the expression control sequence directs transcription of the nucleic acid 
corresponding to the second sequence. 

An "expression vector" is a nucleic acid construct, generated recombinantiy or 
synthetically, with a series of specified nucleic acid elements that permit transcription of a 
particular nucleic acid in a host celL The expression vector can be part of a plasmid, virus, or 
nucleic acid fragment Typically, the expression vector includes a nucleic acid to be 
transcribed operably linked to a promoter. 

The phrase "selectively (or specifically) hybridizes to" refers to the binding, 
duplexing, or hybridizing of a molecule only to a particular nucleotide sequence under 
stringent hybridization conditions when that sequence is present in a complex mixture (e.g., 
total cellular or library DNA or RNA). 

The phrase "stringent hybridization conditions" refers to conditions under 
which a probe will hybridize to its target subsequence, typically in a complex mixture of ~~ 
nucleic acids, but to essentially no other sequences. Stringent conditions are sequence- 
dependent and will be different in different circumstances. Longer sequences hybridize 
specifically at higher temperatures. An extensive guide to the hybridization of nucleic acids 
is found in Tijssen, Techniques in Biochemistry and Molecular Biology— Hybridization with 
Nucleic Probes, "Overview of principles of hybridization and the strategy of nucleic acid 
assays" (1993). Generally, stringent conditions are selected to be about 5-10°C lower than 
the thermal melting point (T^) for the specific sequence at a defined ionic strength pH. The 
T m is the temperature (undo: defined ionic strength, pH, and nucleic concentration) at which 
50% of the probes complementary to the target hybridize to the target sequence at 
equilibrium (as the target sequences are present in excess, at 50% of the probes are 
occupied at equilibrium). Stringent conditions will be those in which the salt concentration is 
less than about 1.0 M sodium ion, typically about 0.01 to 1.0 M sodium ion concentration (or 
other salts) at pH 7.0 to 83 and the temperature is at least about 30°C for short probes (e.g., 
10 to 50 nucleotides) and at least about 60°C for long probes (e.g., greater than 50 
nucleotides). Stringent conditions may also be achieved with the addition of destabilizing 
agents such as formamide. For selective or specific hybridization, a positive signal is at least 
two times background, preferably 10 times background hybridization. Exemplary stringent 
hybridization conditions are often: 50% formamide, 5x SSC, and 1% SDS, incubating at 
42°C, or, 5x SSC, 1% SDS, incubating at 65°C, with wash in 0.2x SSC, and 0.1% SDS at 
65°C. For PGR, a temperature of about 36°C is typical for low stringency amplification, 



18 



WO 02/068677 



PCT/DS02/06001 



although annealing temperatures may vary between about 32°C and 48°C depending on 
prima- length. For high stringency PCR amplification, a temperature of about 62°C is 
typical, although high stringency annealing temperatures can range from about 5 0°C to about 
65 °C, depending on the primer length and specificity. Typical cycle conditions for both high 
and low stringency amplifications include a denaturation phase of 90°C - 95°C for 30 sec - 2 
min, an annealing phase lasting 30 sec. - 2 min, and an extension phase of about 72°C for 1 - 
2 min. Protocols and guidelines for low and high stringency amplification reactions are 
provided, e.g., in Tunis et al. 9 PCR Protocols, A Guide to Methods and Applications (1990). 

Nucleic acids that do not hybridize to each other under stringent conditions are 
still substantially identical if the polypeptides which they encode are substantially identical 
This occurs, e.g., when a copy of a nucleic acid is created using the maximum codon 
degeneracy permitted by the genetic code. In such cases, the nucleic acids typically hybridize 
under moderately stringent hybridization conditions. Exemplary '^moderately stringent " - 
hybridization conditions'* include a hybridization in a buffer of 40% formamide, 1 M NaCl, 
1% SDS at 37°C, and a wash in IX SSC at 45°C. A positive hybridization is at least twice 
background. Those of ordinary skill will readily recognize that alternative hybridization and 
wash conditions can be utilized to provide conditions of similar stringency. Additional 
guidelines for determining hybridization parameters are provided in numerous reference, e.g., 
and Current Protocols in Molecular Biology, ed. Ausubel, et al. 

The phrase 'functional effects" in the context of assays for testing compounds 
that modulate activity of a metastatic colorectal cancer protein includes the determination of a 
parameter that is indirectly or directly under the influence of the metastatic colorectal cancer 
protein or nucleic acid, e.g., an enzymatic, functional, physical, or chemical effect, such as 
the ability to decrease metastatic colorectal cancer. It includes ligand binding activity; cell 
growth on soft agar; anchorage dependence; contact inhibition and density limitation of 
growth; cellular proliferation; cellular transformation; growth factor or serum dependence; 
tumor specific marker levels; invasiveness into Matrigel; tumor growth and metastasis in 
vivo; mRNA and protein expression in cells undergoing metastasis, and other characteristics 
of metastatic colorectal cancer cells. "Functional effects" include in vitro, in vivo, and ex 
vivo activities. 

By "determining the functional effect" is meant assaying for a compound that 
increases or decreases a parameter that is indirectly or directly under the influence of a 
metastatic colorectal cancer protein sequence, e.g., functional, enzymatic, physical and 
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chemical effects. Such functional effects can be measured by any means known to those 
skilled in the art, e.g., changes in spectroscopic characteristics (e.g., fluorescence, 
absorb ance, refractive index), hydrodynamic (e.g., shape), chromatographic, or solubility 
properties for the protein, measuring inducible markers or transcriptional activation of the 
metastatic colorectal cancer protein; measuring binding activity or binding assays, e.g., 
binding to antibodies or other ligands, and measuring cellular proliferation. Determination of 
the functional effect of a compound on metastatic colorectal cancer can also be performed 
using metastatic colorectal cancer assays known to those of skill in the art such as an in vitro 
assays, e.g., cell growth on soft agar; anchorage dependence; contact inhibition and density 
limitation of growth; cellular proliferation; cellular transformation; growth factor or serum 
dependence; tumor specific marker levels; invasiveness into Matrigel; tumor growth and 
metastasis in vivo; mRNA and protein expression in cells undergoing metastasis, and other 
characteristics of metastatic colorectal cancer cells. The functional effects can be evaluated 
by many means known to those skilled in the art, e.g., microscopy for quantitative or 
qualitative measures of alterations in morphological features, measurement of changes in 
RNA or protein levels for metastatic colorectal cancer-associated sequences, measurement of 
RNA stability, identification of downstream or reporter gene expression (CAT, luciferase, [J- 
gal, GFP and the like), ag., via chemUuminescence, fluorescence, colorimetric reactions, 
antibody binding, inducible markers, and ligand binding assays. 

"Inhibitors", "activators", and ''modulators" of metastatic colorectal cancer 
polynucleotide and polypeptide sequences are used to refer to activating, inhibitory, or 
modulating molecules or compounds identified using in vitro and in vivo assays of metastatic 
colorectal cancer polynucleotide and polypeptide sequences of the invention. Inhibitors are 
compounds that, e.g^ bind to, partially or totally block activity, decrease, prevent, delay 
activation, inactivate, desensitize, or down regulate the activity or expression of metastatic 
colorectal cancer proteins of the invention, e.g., antagonists. Antisense nucleic acids may 
seem to inhibit expression and subsequent function of die pro "Activators" are 
compounds that increase, open, activate, facilitate, enhance activation, sensitize, agonize, or 
up regulate metastatic colorectal cancer protein activity. Inhibitors, activators, or modulators 
also include genetically modified versions of metastatic colorectal cancer proteins, e.g., 
versions with altered activity, as well as naturally occurring and synthetic ligands, 
antagonists, agonists, antibodies, small chemical molecules and the like. Such assays for 
inhibitors and activators include, e.g., expressing the metastatic colorectal cancer protein in 
vitro, in cells, or cell membranes, applying putative modulator compounds, and then 
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determining the functional effects cm activity, as described above. Activators and inhibitors 
of metastatic colorectal cancer can also be identified by incubating metastatic colorectal 
cancer cells with the test compound and determining increases or decreases in the expression 
of 1 or more metastatic colorectal cancer proteins, e.g., 1, 2, 3, 4, 5, 10, 15, 20, 25, 30, 40, 50 
or more metastatic colorectal cancer proteins, such as colorectal cancer proteins encoded by 
the sequences set out in Tables 1-26. 

Samples or assays comprising metastatic colorectal cancer proteins that are 
treated with a potential activator, inhibitor, or modulator are compared to control samples 
without the inhibitor, activator, or modulator to examine the extent of inhibition. Control 
samples (untreated with inhibitors) are assigned a relative protein activity value of 100%. 
Inhibition of a polypeptide is achieved when the activity value relative to the control is about 
80%, preferably 50%, more preferably 25-0%. Activation of a metastatic colorectal cancer 
polypeptide is achieved when the activity value relative to the control (untreated with _ ~ 
activators) is 1 10%, more preferably 150%, more preferably 200-500% (i.e., two to five fold 
higher relative to the control), more preferably 1000-3000% higher. 

The phrase "changes in cell growth" refers to any change in cell growth and 
proliferation characteristics in vitro or in vivo, such as formation of foci, anchorage 
independence, semi-solid or soft agar growth, changes in contact inhibition and density 
limitation of growth, loss of growth factor or serum requirements, changes in cell 
morphology, gaining or losing immortalization, gaining or losing tumor specific markers, 
ability to form or suppress tumors when injected into suitable animal hosts, and/or 
immortalization of the cell See, eg., Freshney, Culture of Animal Cells a Manual of Basic 
Technique pp. 231-241 (3 rd ed 1994). 

"Tumor cell" refers to precancerous, cancerous, and normal cells in a tumor. 

"Cancer cells," "transformed" cells or "transformation" in tissue culture, refers 
to spontaneous or induced phenotypic changes that do not necessarily involve the uptake of 
new genetic material. Although transformation can arise fiom infection with a transforming 
virus and incorporation of new genomic DNA, or uptake of exogenous DNA, it can also arise 
spontaneously or following exposure to a carcinogen, thereby mutating an endogenous gene. 
Transformation is associated with phenotypic changes, such as immortalization of cells, 
aberrant growth control, nonmorphological changes, and/or malignancy (see, Freshney, 
Culture of Animal Cells a Manual of Basic Technique (3 rd ed. 1994)). 

"Antibody" refers to a polypeptide comprising a framework region from an 
immunoglobulin gene or fragments thereof that specifically binds and recognizes an antigen. 
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The recognized immunoglobulin genes include the kappa, lambda, alpha, gamma, delta, 
epsilon, and mu constant region genes, as well as the myriad immunoglobulin variable region 
genes. Light chains are classified as either kappa or lambda. Heavy chains are classified as 
gamma, mu, alpha, delta, or epsilon, which in turn define the immunoglobulin classes, IgG, 
IgM, IgA, IgD and IgE, respectively. Typically, the antigen-binding region of an antibody or 
its functional equivalent will be most critical in specificity and affinity of binding. See Paul, 
Fundamental Immunology. 

An exemplary immunoglobulin (antibody) structural unit comprises a 
tetramer. Each tetramer is composed of two identical pairs of polypeptide chains, each pair 
having one "lighf * (about 25 kD) and one "heavy" chain (about 50-70 kD). The N-terminus 
of each chain defines a variable region of about 100 to 1 10 or more amino acids primarily 
responsible for antigen recognition. The terms variable light chain (V l) and variable heavy 
chain (Vh) refer to these light and heavy chains respectively. _ ? 

Antibodies exist, e.g., as intact immunoglobulins or as a number of well- 
characterized fragments produced by digestion with various peptidases. Thus, e.g., pepsin 
digests an antibody below the disulfide linkages in the hinge region to produce F(ab')2, a 
dimer of Fab which itself is a ligjit chain joined to Vh-Ch1 by a disulfide bond. The F(ab l ) 2 
may be reduced under mild conditions to break the disulfide linkage in the hinge regi on, 
thereby converting the F(ab r ) > 2 dimer into an Fab' monomer. The Fab' monomer is 
essentially Fab with part of the hinge region (see Fundamental Immunology (Paul ed., 3d ed. 
1993). While various antibody fragments are defined in terns of the digestion of an intact 
antibody, one of skill will appreciate that such fragments may be synthesized de novo either 
chemically or by using recombinant DNA methodology. Thus, the term antibody, as used 
herein, also includes antibody fragments either produced by the modification of whole 
antibodies, or those synthesized de novo using recombinant DNA methodologies (e.g., single 
chain Fv) or those identified using phage display libraries (see, eg, McCafferty et aL y Nature 
348:552-554(1990)) 

For preparation of antibodies, e.g., recombinant, monoclonal, or polyclonal 
antibodies, many technique known in the art can be used (see, e.g., Kohler & Milstein, 
Nature 256:495-497 (1975); Kozbor et al y Immunology Today 4:72 (1983); Cole et al., pp. 
77-96 in Monoclonal Antibodies and Cancer Therapy (1985); Coligan, Current Protocols in 
Immunology (1991); Harlow & Lane, Antibodies, A Laboratory Manual (1988); and Goding, 
Monoclonal Antibodies: Principles and Practice (2d ed. 1986)). Techniques for the 
production of single chain antibodies (U.S. Patent 4,946,778) can be adapted to produce 
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antibodies to polypeptides of this invention. Also, transgenic mice, or other organisms such 
as other mammals, may be used to express humanized antibodies. Alternatively, phage 
display technology can be used to identify antibodies and heteromeric Fab fragments that 
specifically bind to selected antigens {see, e.g. 3 McCafierty et aL y Nature 348:552-554 
(1990); Maries et dL> Biotechnology 10:779-783 (1992)). 

A "chimeric antibody" is an antibody molecule in which, e.g, (a) the constant 
region, or a portion thereof; is altered, replaced or exchanged so that the antigen binding site 
(variable region) is linked to a constant region of a different or altered class, effector function 
and/or species, or an entirely different molecule which confers new properties to the chimeric 
antibody, e.g., an enzyme, toxin, hormone, growth factor, drug, etc.; or (b) the variable 
region, or a portion thereof is altered, replaced or exchanged with a variable region having a 
different or altered antigen specificity. 

Identification of metastatic colorectal cancer-associated sequences 

In one aspect, the expression levels of genes are determined in different 
patient samples for which diagnosis information is desired, to provide expression profiles. 
An expression profile of a particular sample is essentially a "fingerprint" of the state of the 
sample; while two states may have any particular gene similarly expressed, the evaluation of 
a number of genes simultaneously allows the generation of a gene expression profile that is 
characteristic of the state of the cell. That is, normal tissue may be distinguished from 
cancerous or metastatic cancerous tissue, or metastatic cancerous tissue can be compared 
with tissue from surviving cancer patients. By comparing expression profiles of tissue in 
known different metastatic colorectal cancer states, information regarding which genes are 
important (including both up- and down-regulation of genes) in each of these states is 
obtained. 

The identification of sequences that are differentially expressed in metastatic 
colorectal cancer versus non-metastatic colorectal cancer tissue allows the use of this 
information in a number of ways. For example, a particular treatment regime may be 
evaluated: does a chemotherapeutic drag act to down-regulate metastatic colorectal cancer, 
and thus tumor growth or recurrence, in a particular patient Similarly, diagnosis and 
treatment outcomes may be done or confirmed by comparing patient samples with the known 
expression profiles. Metastatic tissue can also be analyzed to determine the stage of 
metastatic colorectal cancer in the tissue. Furthermore, these gene expression profiles (or 
individual genes) allow screening of drug candidates with an eye to mimicking or altering a 
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particular expression profile; e.g., screening can be done for drugs that suppress the 
metastatic colorectal cancer expression profile. This may be done by making biochips 
comprising sets of the important metastatic colorectal cancer genes, which can then be used 
in these screens. PCR methods may be applied with selected primer pairs, and analysis may 
be of RNA or of genomic sequences. These methods can also be done on the protein basis; 
that is, protein expression levels of the metastatic colorectal cancer proteins can be evaluated 
for diagnostic purposes or to screen candidate agents. In addition, the metastatic colorectal 
cancer nucleic acid sequences can be administered for gene therapy purposes, including the 
administration of antisense nucleic acids, or the metastatic colorectal cancer proteins 
(including antibodies and other modulators thereof) administered as therapeutic drugs or as 
protein or DNA vaccines. 

Thus the present invention provides nucleic acid and protein sequences that 
are differentially expressed in metastatic colorectal cancer, herein termed "metastatic 
colorectal cancer sequences." As outlined below, metastatic colorectal cancer sequences 
include those that are up-regulated (i.e., expressed at a higher level) in metastatic colorectal 
cancer, as well as those that are down-regulated (i.e., expressed at a lower level). In a 
preferred embodiment, the metastatic colorectal cancer sequences are from humans; however, 
as will be appreciated by those in the art, metastatic colorectal cancer sequences from other 
organisms may be useful in animal models of disease and drug evaluation; thus, other 
metastatic colorectal cancer sequences are provided, from vertebrates, including mammals, 
including rodents (rats, mice, hamsters, guinea pigs, etc.), primates, farm animals (including 
sheep, goats, pigs, cows, horses, etc.) and pets (dogs, cats, etc.). Metastatic colorectal cancer 
sequences from other organisms may be obtained using the techniques outlined below. 

Metastatic colorectal cancer sequences can include both nucleic acid and 
amino acid sequences. As will be appreciated by those in the art and is more fully outlined 
below, metastatic colorectal cancer nucleic add sequences are useful in a variety of 
applications, including diagnostic applications, which will detect naturally occurring nucleic 
acids, as well as screening applications; e.g., biochips comprising nucleic acid probes or PCR 
microtiter plates with selected probes to the metastatic colorectal cancer sequences can be 
generated 

A metastatic colorectal cancer sequence can be initially identified by 
substantial nucleic acid and/or amino acid sequence homology to the metastatic colorectal 
cancer sequences outlined herein. Such homology can be based upon the overall nucleic acid 
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or amino acid sequence, and is generally determined as outlined below, using either 
homology programs or hybridization conditions. 

For identifying metastatic colorectal cancer-associated sequences, die 
metastatic colorectal cancer screen typically includes comparing genes identified in different 
tissues, e.g., normal and cancerous tissues, or tumor tissue samples from patients who have 
metastatic disease vs. non metastatic tissue, or tumor tissue samples from patients who have 
been diagnosed with Dukes stage A or B cancer but have survived vs. metastatic tissue. 
Other suitable tissue comparisons include comparing metastatic colorectal cancer samples 
with metastatic cancer samples from other cancers, such as lung, breast, other gastrointestinal 
cancers, prostate, ovarian, etc. Samples o£ e.g., Dukes stage B survivor tissue and tissue 
undergoing metastasis are applied to biochips comprising nucleic acid probes. The samples 
are first microdissected, if applicable, and treated as is known in the art for the preparation of 
mRNA. Suitable biochips are commercially available, e.g., from Afiymetrix. Gene .? 
expression profiles as described herein are generated and the data analyzed. 

In one embodiment, the genes showing changes in expression as between 
normal and disease states are compared to genes expressed in other normal tissues, preferably 
normal colon, but also including, and not limited to lung, heart, brain, liver, breast, kidney, 
muscle, prostate, small intestine, large intestine, spleen, bone and placenta. In a preferred 
embodiment, those genes identified during the metastatic colorectal cancer screen that are 
expressed in significant amounts in other tissues are removed from the profile, although in 
some embodiments, this is not necessary. That is, when screening for drugs, it is usually 
preferable that the target be disease specific, to minimize possible side effects. 

In a preferred embodiment, metastatic colorectal cancer sequences are those 
that are up-regulated in metastatic colorectal cancer; that is, the expression of these genes is 
higher in the metastatic tissue as compared to non-metastatic cancerous tissue or normal 
colon tissue (see, eg., Tables 1-26). "Up-regulation" as used herein means, when the ratio is 
presented as a number greater than one, that the ratio is greater than one, preferably 1.5 or 
greater, more preferably 2.0 or greater. All UniGene cluster identification numbers and 
accession numbers herein are for the GenBank sequence database and the sequences of the 
accession numbers are hereby expressly incorporated by reference. GenBank is known in the 
art, see, eg., Benson, DA, et aL, Nucleic Acids Research 26:1-7 (1998) and 
http:ZAvwwjicbi.nIm.nih.gov/. Sequences are also available in other databases, e.g., 
European Molecular Biology Laboratory (EMBL) and DNA Database of Japan (DDBJ). 
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In another preferred embodiment, metastatic colorectal cancer sequences are 
those that are down-regulated in the metastatic colorectal cancer, that is, the expression of 
these genes is lower in metastatic tissue as compared to non-metastatic cancerous tissue or 
normal colon tissue {see, e.g., Tables 1-26). "Down-regulation" as used herein means, when 
the rati o i s presented as a number greater than one, that the ratio is greater than one, 
preferably 1.5 or greater, more preferably 2.0 or greater, or, when the ratio is presented as a 
number less than one, that the ratio is less than one, preferably 0.5 or less, more preferably 
0.25 or less. 

Informatics 

The ability to identify genes that are over or under expressed in metastatic 
colorectal cancer can additionally provide high-resolution, high-sensitivity datasets which can 
be used in the areas of diagnostics, therapeutics, drug development, pharmacogenetics, _ ; 
protein structure, biosensor development, and other related areas. For example, the 
expression profiles can be used in diagnostic or prognostic evaluation of patients with 
metastatic colorectal cancer. Or as another example, subcellular toxicological information 
can be generated to better direct drug structure and activity correlation (see Anderson, 
Pharmaceutical Proteomics: Targets, Mechanism, and Function, paper presented at the BBC 
Proteomics conference, Coronado, CA (June 1 1-12, 1998)). Subcellular toxicological 
information can also be utilized in a biological sensor device to predict the likely 
toxicological effect of chemical exposures and likely tolerable exposure thresholds (see U.S. 
Patent No. 5,811,231). Similar advantages accrue from datasets relevant to other 
biomolecules and bioactive agents (e.g., nucleic acids, saccharides, lipids, drugs, and the 
like). 

Thus, in another embodiment, the present invention provides a database that 
includes at least one set of assay data. The data contained in the database is acquired, e.g., 
using array analysis either singly or in a library format The database can be in substantially 
any form in which data can be maintained and transmitted, but is preferably an electronic 
database. The electronic database of the invention can be maintained on any electronic 
device allowing for the storage of and access to the database, such as a personal computer, 
but is preferably distributed on a wide area network, such as the World Wide Web. 

The focus of the present section on databases that include peptide sequence 
data is for clarity of illustration only. It will be apparent to those of skill in the art that similar 
databases can be assembled for assay data acquired using an assay of the invention. 



26 



WO 02/068677 



PCT7US02/06001 



The compositions and methods for identifying and/or quantitating the relative 
and/or absolute abundance of a variety of molecular and macromolecular species from a 
biological sample undergoing metastatic colorectal cancer, Le., the identification of 
metastatic colorectal cancer-associated sequences described herein, provide an abundance of 
information, which can be correlated with pathological conditions, predisposition to disease, 
drug testing, therapeutic monitoring, gene-disease causal linkages, identification of correlates 
of immunity and physiological status, among others. Although the data generated from the 
assays of the invention is suited for nian ^ al review and analysis, in a preferred embodiment, 
prior data processing using high-speed computers is utilized. 

An array of methods for indexing and retrieving biomolecular information is 
known in the art For example, U.S. Patents 6,023,659 and 5,966,712 disclose a relational 
database system for storing biomolecular sequence information in a manner ihat allows 
sequences to be catalogued and searched according to one or more protein function 
hierarchies. U.S. Patent 5,953,727 discloses a relational database having sequence records 
containing information in a format that allows a collection of partial-length DNA sequences 
to be catalogued and searched according to association with one or more sequencing projects 
for obtaining full-length sequences from the collection of partial length sequences. U.S. 
Patent 5,706,498 discloses a gene database retrieval system for making a retrieval of a gene 
sequence similar to a sequence data item in a gene database based on the degree of similarity 
between a key sequence and a target sequence. U.S. Patent 5,538,897 discloses a method 
using mass spectroscopy fragmentation patterns of peptides to identify ammo acid, sequences 
in computer databases by comparison of predicted mass spectra with experimentally-derived 
mass spectra using a closeness-of-fit measure. U.S. Patent 5,926,818 discloses a multi- 
dimensional database comprising a functionality for multi-dimensional data analysis 
described as on-line analytical processing (OLAP), which entails the consolidation of 
projected and actual data according to more than one consolidation path or dimension. U.S. 
Patent 5,295,261 reports a hybrid database structure in which the fields of each database 
record are divided into two classes, navigational and informational data, with navigational 
fields stored in a hierarchical topological map which can be viewed as a tree structure or as 
the merger of two or more such tree structures. 

See also Mount et aL, Bioinformatics (2001); Biological Sequence Analysis: 
Probabilistic Models of Proteins and Nucleic Adds (Durbin et aL, eds., 1999); 
Bioinformatics: A Practical Guide to the Analysis of Genes and Proteins (Baxevanis & 
Oeullette eds., 1998)); Rashidi & Buehler, Bioinformatics: Basic Applications in Biological 
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Science and Medicine (1999); Introduction to Computational Molecular Biology (Setubal et 
al 9 eds 1997); Bioinformatics: Methods and Protocols (Misener & Krawetz, eds, 2000); 
Bioinformatics: Sequence, Structure, and Databanks: A Practical Approach (Higgins & 
Taylor, eds., 2000); Brown, Bioinformatics: A Biologist's Guide to Biocomputing and the 
Internet (2001); Han & Kamber, Data Mining: Concepts and Techniques (2000); and 
Waterman, Introduction to Computational Biology: Maps, Sequences, and Genomes (1995). 

The present invention provides a computer database comprising a computer 
and software for storing in computer-retrievable form assay data records cross-tabulated, e.g., 
with data specifying the source of the target-containing sample from which each sequence 
specificity record was obtained. 

In an exemplary embodiment, at least one of the sources of target-containing 
sample is from a control tissue sample known to be free of pathological disorders. In a 
variation, at least one of the sources is a known pathological tissue specimen, e.g., a 
neoplastic lesion or another tissue specimen to be analyzed for metastatic colorectal cancer. 
In another variation, the assay records cross-tabulate one or more of the following parameters 
for each target species in a sample: (1) a unique identification code, which can include, e.g., a 
target molecular structure and/or characteristic separation coordinate (e.g., electrophoretic 
coordinates); (2) sample source; and (3) absolute and/or relative quantity of the target species 
present in the sample. 

The invention also provides for the storage and retrieval of a collection of 
target data in a computer data storage apparatus, which can include magnetic disks, optical 
disks, magneto-optical disks, DRAM, SRAM, SGRAM, SDRAM, RDRAM, DDR RAM, 
magnetic bubble memory devices, and other data storage devices, including CPU registers 
and on-CPU data storage arrays. Typically, the target data records are stored as a bit pattern 
in an array of magnetic domains on a magnetizable medium or as an array of charge states or 
transistor gate states, such as an array of cells in a DRAM device (e.g. , each cell comprised of 
a transistor and a charge storage area, which maybe on the transistor), hi one embodiment, 
file invention provides such storage devices, and computer systems built therewith, 
comprising a bit pattern encoding a protein expression fingerprint record comprising unique 
identifiers for at least 10 target data records cross-tabulated with target source. 

When the target is a peptide or nucleic acid, the invention preferably provides 
a method for identifying related peptide or nucleic acid sequences, comprising performing a 
computerized comparison between a peptide or nucleic acid sequence assay record stored in 
or retrieved from a computer storage device or database and at least one other sequence. The 
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comparison can include a sequence analysis or comparison algorithm or computer program 
embodiment thereof (eg., FASTA, TFASTA, GAP, BESTFTT) and/or the comparison may 
be of the relative amount of a peptide or nucleic acid sequence in a pool of sequences 
determined from a polypeptide or nucleic acid sample of a specimen. 

Hie invention also preferably provides a magnetic disk, such as an IBM- 
compatible (DOS, Windows, Windows95/98/2000, Windows NT, OS/2) or other format 
(e.g., Linux, SunOS, Solaris, ADC, SCO Unix, VMS, MV, Macintosh, etc.) floppy diskette or 
hard (fixed, Winchester) disk drive, comprising a bit pattern encoding data from an assay of 
the invention in a file format suitable for retrieval and processing in a computerized sequence 
analysis, comparison, or relative quantitation method. 

The invention also provides a network, comprising a plurality of computing 
devices linked via a data link, such as an Ethernet cable (coax or lOBaseT), telephone line, 
ISDN line, wireless network, optical fiber, or other suitable signal transmission medium, ~ 
whereby at least one network device (e.g., computer, disk array, etc.) comprises a pattern of 
magnetic domains (e.g., magnetic disk) and/or charge domains (e.g., an array of DRAM 
cells) composing a bit pattern encoding data acquired from an assay of the invention. 

The invention also provides a method for transmitting assay data that includes 
generating an electronic signal on an electronic communications device, such as a modem, 
ISDN terminal adapter, DSL, cable modem, ATM switch, or the like, wherein the signal 
includes (in native or encrypted format) a bit pattern encoding data from an assay or a 
database comprising a plurality of assay results obtained by the method of the invention. 

In a preferred embodiment, the invention provides a computer system for 
comparing a query target to a database containing an array of data structures, such as an assay 
result obtained by the method of the invention, and ranking database targets based on the 
degree of identity and gap weight to the target data. A central processor is preferably 
initialized to load and execute the compute: program for alignment and/or comparison of the 
assay results. Data for a quay target is entered into the central processor via an I/O device. 
Execution of the computer program results in the central processor retrieving the assay data 
from the data file, which comprises a binary description of an assay result 

The target data or record and the computer program can be transferred to 
secondary memory, winch is typically random access memory (e.g., DRAM, SRAM, 
SGRAM, or SDRAM). Targets are ranked according to the degree of correspondence 
between a selected assay characteristic (e.g., binding to a selected affinity moiety) and the 
same characteristic of the query target and results are output via an I/O device. For example, 
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a central processor can be a conventional computer (e.g., Intel Pentium, PowerPC, Alpha, 
PA-8000, SPARC, MIPS 4400, MIPS 10000, VAX, eta); a program can be a commercial or 
public domain molecular biology software package (e.g., UWGCG Sequence Analysis 
Software, Darwin); a data file can be an optical or magnetic disk, a data server, a memory 
device (e.g., DRAM, SRAM, SGRAM, SDRAM, EPROM, bubble memory, flash memory, 
etc.); an VO device can be a terminal comprising a video display and a keyboard, a modem, 
an ISDN terminal adapter, an Ethernet port, a punched card reader, a magnetic strip reader, or 
other suitable VO device. 

The invention also preferably provides the use of a computer system, such as 
that described above, which comprises: (1) a computer, (2) a stored bit pattern encoding a 
collection of peptide sequence specificity records obtained by the methods of the invention, 
which may be stored in the computer, (3) a comparison target, such as a query target; and (4) 
a program for alignment and comparison, typically with rank-ordering of comparison results 
on the basis of computed similarity values. 

Characteristics of metastatic colorectal cancer-associated proteins 

Metastatic colorectal cancer proteins of the present invention may be classified 
as secreted proteins, transmembrane proteins or intracellular proteins. In one embodiment, 
the metastatic colorectal cancer protein is an intracellular protein. Intracellular proteins may 
be found in the cytoplasm and/or in the nucleus and/or in the organelles. Proteins containing 
one or more transmembrane domains that exclusively reside in organelles are also considered 
intracellular proteins. Intracellular proteins are involved in all aspects of cellular function 
and replication (including, e.g., signaling pathways); aberrant expression of such proteins 
often results in unregulated or disregulated cellular processes (see, e g., Molecular Biology of 
the Cell (Alberts, eA, 3rd ed, 1994). For example, many intracellular proteins have 
enzymatic activity such as protein kinase activity, protein phosphatase activity, protease 
activity, nucleotide cyclase activity, polymerase activity and the like. Intracellular proteins 
also serve as docking proteins that are involved in organizing complexes of proteins, or 
targeting proteins to various subcellular localizations, and are involved in maintaining the 
structural integrity of organelles. 

An increasingly appreciated concept in characterizing proteins is the presence 
in the proteins of one or more motifs for which defined functions have been attributed. In 
addition to the highly conserved sequences found in the enzymatic domain of proteins, highly 
conserved sequences have been identified in proteins that are involved in protein-protein 
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interaction. For example, Src-homology-2 (SH2) domains bind tyrosine-phosphorylaied 
targets in a sequence dependent manner. PTB domains, which are distinct ftom SH2 
domains, also bind tyrosine phosphorylated targets. SH3 domains bind to proline-rich 
targets. In addition, PH domains, tetratricopeptide repeats and WD domains to name only a 
few, have been shown to mediate protein-protein interactions. Some of these may also be 
involved in binding to phospholipids or other second messengers. As will be appreciated by 
one of ordinary skill in the art, these motifs can be identified on the basis of primary 
sequence; thus, an analysis of the sequence of proteins may provide insight into both the 
enzymatic potential of the molecule and/or molecules with which the protein may associate. 
One useful database is Pfam (protein families), which is a large collection of multiple 
sequence alignments and hidden Markov models covering many common protein domains. 
Versions are available via the internet from Washington University in St Louis, the Sanger 
Center in England, and the Karolinska Institute in Sweden (see, e.g., Bateman et al, Nuc J 
Acids Res. 28:263-266 (2000); Sonnhammer et al, Proteins 28:405-420 (1997); Bateman et 
aL, Nuc. Adds Res. 27:260-262 (1999); and Sonnhammer et al., Nuc. Acids Res. 26:320-322- 
(1998)). 

In another embodiment, the metastatic colorectal cancer sequences are 
transmembrane proteins. Transmembrane proteins are molecules that span a phospholipid 
bilayer of a cell. They may have an intracellular domain, an extracellular domain, or both. 
The intracellular domains of such proteins may have a number of functions including those 
already described for intracellular proteins. For example, the intracellular domain may have 
enzymatic activity and/or may serve as a binding site for additional proteins. Frequently the 
intracellular domain of transmembrane proteins serves both roles. For example certain 
receptor tyrosine kinases have both protein kinase activity and SH2 domains. In addition, 
autophosphorylation of tyrosines on the receptor molecule itself creates binding sites for 
additional SH2 domain containing proteins. 

Transmembrane proteins may contain from one to many transmembrane 
domains. For example, receptor tyrosine kinases, certain cytokine receptors, receptor 
guanylyl cyclases and receptor serine/threonine protein kinases contain a single 
transmembrane domain. However, various other proteins including channels, pumps, and 
adenylyl cyclases contain numerous transmembrane domains. Many important cell surface 
receptors such as G protein coupled receptors (GPCRs) are classified as "seven 
transmembrane domain" proteins, as they contain 7 membrane spanning regions. 
Characteristics of transmembrane domains include a ppr o xim ately 20 consecutive 



31 



WO 02/068677 PCT/US02/06001 

hydrophobic amino acids that may be followed by charged amino acids. Therefore, upon 
analysis of the amino acid sequence of a particular protein, the localization and number of 
transmembrane domains within the protein may be predicted (see, e.g. PSORT web site 
http://psoitJiibb.ac.jp/). 

The extracellular domains of transmembrane proteins are diverse; however, 
conserved motifs are found repeatedly among various extracellular domains. Conserved 
structure and/or functions have been ascribed to different extracellular motifs. Many 
extracellular domains are involved in binding to other molecules. In one aspect, extracellular 
domains are found on receptors. Factors that bind the receptor domain include circulating 
ligands, which may be peptides, proteins, or small molecules such as adenosine and the like. 
For example, growth factors such as EGF, FGF and PDGF are circulating growth factors that 
bind to their cognate receptors to initiate a variety of cellular responses. Other factors include 
cytokines, mitogenic factors, hormones, neurotrophic factors and the like. Extracellular - - 
domains also bind to cell-associated molecules. In this respect, they mediate cell-cell 
interactions. Cell-associated ligands can be tethered to the cell, e.g., via a 
glycosylphosphatidylinositol (GET) anchor, or may themselves be transmembrane proteins. 
Extracellular domains also associate with the extracellular matrix and contribute to the 
maintenance of the cell structure. 

Metastatic colorectal cancer proteins that are transmembrane are particularly 
preferred in the present invention as they are readily accessible targets for extracellular 
immunotherapeutics, as are described herein, hi addition, as outlined below, transmembrane 
proteins can be also useful in imaging modalities. Antibodies may be used to label such 
readily accessible proteins in situ or in histological analysis. Alternatively, antibodies can 
also label intracellular proteins, in which case analytical samples are typically permeablized 
to provide access to intracellular proteins. 

It will also be appreciated by those in the art that a transmembrane protein can 
be made soluble by removing transmembrane sequences, e.g., through recombinant methods. 
Furthermore, transmembrane proteins that have been made soluble can be made to be 
secreted through recombinant means by adding an appropriate signal sequence. 

In another embodiment, the metastatic colorectal cancer proteins are secreted 
proteins; the secretion of which can be either constitutive or regulated. These proteins have a 
signal peptide or signal sequence that targets the molecule to the secretory pathway. Secreted 
proteins are involved in numerous physiological events; by virtue of their circulating nature, 
they often serve to transmit signals to various other cell types. The secreted protein may 
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function in an autocrine manner (acting on the cell that secreted the factor), a paracrine 
manner (acting on cells in close proximity to the cell that secreted the factor) or an endocrine 
manner (acting on cells at a distance). Thus secreted molecules find use in modulating or 
altering numerous aspects of physiology. Metastatic colorectal cancer proteins that are 
secreted proteins are particularly preferred in the present invention as they serve as good 
targets for diagnostic markers, e.g., for blood, plasma, serum, or stool tests. 

Use of metastatic colorectal cancer nucleic adds 

As described above, metastatic colorectal cancer sequence is initially 
identified by substantial nucleic acid and/or amino acid sequence homology or linkage to the 
metastatic colorectal cancer sequences outlined herein. Such homology can be based upon 
the overall nucleic acid or amino acid sequence, and is generally determined as outlined 
below, using either homology programs or hybridization conditions. Typically, linked _ ~" 
sequences on a mRNA are found on the same molecule. 

The metastatic colorectal cancer nucleic acid sequences of the invention, e.g., 
the sequences in Tables 1-26, can be fragments of larger genes, i.e., they are nucleic acid 
segments. "Genes" in this context includes coding regions, non-coding regions, and mixtures 
of coding and non-coding regions. Accordingly, as will be appreciated by those in the art, 
using the sequences provided herein, extended sequences, in either direction, of the 
metastatic colorectal cancer genes can be obtained, using techniques well known in the art for 
cloning either longer sequences or the full length sequences; see Ansubel, et aL , supra. 
Much can be done by informatics and many sequences can be clustered to include multiple 
sequences corresponding to a single gene, e.g., systems such as UniGene (see, 
http://www jncbi.nlm.nih.gov/unigene/). 

Once die metastatic colorectal cancer nucleic acid is identified, it can be 
cloned and, if necessary, its constituent parts recombined to form the entire metastatic 
colorectal cancer nucleic acid coding regions or the entire mRNA sequence. Once isolated 
from its natural source, e.g., contained within a plasmid or other vector or excised therefrom 
as a linear nucleic acid segment, the recombinant metastatic colorectal cancer nucleic acid 
can be further-used as a probe to identify and isolate other metastatic colorectal cancer 
nucleic acids, e.g., extended coding regions. It can also be used as a "precursor" nucleic acid 
to make modified or variant metastatic colorectal cancer nucleic acids and proteins . 

The metastatic colorectal cancer nucleic acids of die present invention are 
used in several ways. In a first embodiment, nucleic acid probes to the metastatic colorectal 
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cancer nucleic acids are made and attached to biochips to be used in screening and diagnostic 
methods, as outlined below, or for administration, e.g., for gene therapy, vaccine, and/or 
antisense applications. Alternatively, the metastatic colorectal cancer nucleic adds that 
include coding regions of metastatic colorectal cancer proteins can be put into expression 
vectors for the expression of metastatic colorectal cancer proteins, again for screening 
purposes or for administration to a patient 

In a preferred embodiment, nucleic acid probes to metastatic colorectal cancer 
nucleic acids (both the nucleic acid sequences outlined in the figures and/or the complements 
thereof) are made. The nucleic acid probes attached to the biochip are designed to be 
substantially complementary to the metastatic colorectal cancer nucleic acids, i.e. the target 
sequence (either the target sequence of the sample or to other probe sequences, e.g., in 
sandwich assays), such that hybridization of the target sequence and the probes of the present 
invention occurs. As outlined below, this complementarity need not be perfect; there may.be 
any number of base pair mismatches which will interfere with hybridization between die 
target sequence and the single stranded nucleic acids of the present invention. However, if 
the number of mutations is so great that no hybridization can occur under even the least 
stringent of hybridization conditions, the sequence is not a complementary target sequence. 
Thus, by "substantially complementary" herein is meant that the probes are sufficiently 
complementary to the target sequences to hybridize under appropriate reaction conditions, 
particularly high stringency conditions, as outlined herein. 

A nucleic acid probe is generally angle stranded but can be partially single 
and partially double stranded. The strandedness of the probe is dictated by the structure, 
composition, and properties of the target sequence. In general, the nucleic acid probes range 
from about 8 to about 100 bases long, with from about 10 to about 80 bases being preferred, 
and from about 30 to about 50 bases being particularly preferred. That is, generally 
complements of ORFs or whole genes are not used. In some embodiments, nucleic acids of 
lengths up to hundreds of bases can be used. 

In a preferred embodiment, more than one probe per sequence is used, with 
either overlapping probes or probes to different sections of the target being used. That is, 
two, three, four or more probes, with three being preferred, are used to build in a redundancy 
for a particular target The probes can be overlapping (i.e., have some sequence in common), 
or separate. la some cases, PCR primers may be used to amplify signal for higher sensitivity. 

As will be appreciated by those in the art, nucleic acids can be attached or 
immobilized to a solid support in a wide variety of ways. By "immobilized" and grammatical 
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equivalents herein is meant the association or binding between the nucleic acid probe and the 
solid support is sufficient to be stable under the conditions of binding, washing, analysis, and 
removal as outlined below. The binding can typically be covalent or non-covalent By "non- 
covalent binding" and grammatical equivalents herein is typically meant one or more of 
electrostatic, hydrophilic, and hydrophobic interactions. Included in non-covalent blading is 
the covalent attachment of a molecule, such as, strep tavidin to the support and the non- 
covalent binding of the biotinylated probe to the streptavidin. By "covalent binding" and 
grammatical equivalents herein is meant that the two moieties, the solid support and the 
probe, are attached by at least one bond, including sigma bonds, pi bonds and coordination 
bonds. Covalent bonds can be formed directly between the probe and the solid support or can 
be formed by a cross linker or by inclusion of a specific reactive group on either the solid . 
support or the probe or both molecules. Immobilization may also involve a combination of 
covalent and non-covalent interactions. 

In general, the probes are attached to a biochip in a wide variety of ways, as 
will be appreciated by those in the art As described herein, the nucleic acids can either be 
synthesized first, with subsequent attachment to the biochip, or can be directly synthesized on 
the biochip. 

jrbe biochip comprises a suitable solid substrate. By "substrate" or "solid 
support" or other grammatical equivalents herein is meant a material that can be modified to 
contain discrete individual sites appropriate for the attachment or association of the nucleic 
acid probes and is amenable to at least one detection method As will be appreciated by those 
in the art, die number of possible substrates are very large, and include, but are not limited to, 
glass and modified or functionalized glass, plastics (including acrylics, polystyrene and 
copolymers of styrene and other materials, polypropylene, polyethylene, polybutylene, 
polyurethanes, Teflon, etc.), polysaccharides, nylon or nitrocellulose, resins, silica or silica- 
based materials including silicon and modified silicon, carbon, metals, inorganic glasses, 
plastics, etc. In general, the substrates allow optical detection and do not appreciably 
fluoresce. A preferred substrate is described in copending application entitled Reusable Low 
Fluorescent Plastic Biochip, U.S. Application Serial No. 09/270,214, filed March 15, 1999, 
herein incorporated by reference in its entirety. 

Generally the substrate is planar, although as will be appreciated by those in 
the art, other configurations of substrates may be used as welL For example, the probes may 
be placed on the inside surface of a tube, for flow-through sample analysis to minimize 
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sample volume. Similarly, the substrate may be flexible, such as a flexible foam, including 
closed cell foams made of particular plastics. 

In a preferred embodiment, the surface of the biochip and the probe may be 
derivatized with chemical functional groups for subsequent attachment of the two. Thus, e.g., 
the biochip is derivatized with a chemical functional group including, but not limited to, 
amino groups, carboxy groups, oxo groups and thiol groups, with amino groups being 
particularly preferred. Using these functional groups, the probes can be attached using 
functional groups on the probes. For example, nucleic acids containing amino groups can be 
attached to surfaces comprising amino groups, e.g., using linkers as are known in the art; e.g., 
homo-or hetero-bifunctional linkers as are well known {see 1994 Pierce Chemical Company 
catalog, technical section on cross-linkers, pages 1 55-200). In addition, in some cases, 
additional linkers, such as alkyl groups (including substituted and heteroalkyl groups) may be 
used. 

In this embodiment, oligonucleotides are synthesized as is known in the art, 
and then attached to the surface of the solid support As will be appreciated by those skilled 
in the art, either the 5' or 3' terminus maybe attached to the solid support, or attachment may 
be via an internal nucleoside. 

In another embodiment, the immobilization to the solid support may be very 
strong, yet non-covalent For example, biotinylated oligonucleotides can be made, which 
bind to surfaces covalently coated with streptavidin, resulting in attachment 

Alternatively, die oligonucleotides may be synthesized on the surface, as is 
known in the art For example, photoacti vation techniques utilizing photopolymerization 
compounds and techniques are used, hi a preferred embodiment, the nucleic acids can be 
synthesized in situ, using well known photolithographic techniques, such as those described 
in WO 95/25 1 16; WO 95/35505; U.S. Patent Nos. 5,700,637 and 5,445,934; and references" 
cited within, all of which are expressly incorporated by reference; these methods of 
attachment form the basis of the Affimetrix GeneChip™ technology. 

Often, amplification-based assays are performed to measure the expression 
level of metastatic colorectal cancer-associated sequences. These assays are typically 
performed in conjunction with reverse transcription. In such assays, a metastatic colorectal 
cancer-associated nucleic acid sequence acts as a template in an amplification reaction (e.g., 
Polymerase Chain Reaction, or PCR). In a quantitative amplification, the amount of 
amplification product will be proportional to the amount of template in the original sample. 
Comparison to appropriate controls provides a measure of the amount of metastatic colorectal 
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cancer-associated RNA. Methods of quantitative amplification are well known to those of 
skill in the art Detailed protocols for quantitative PCR are provided, e.g., in hmis et aL, PCR 
Protocols, A Guide to Methods and Applications (1990). 

In some embodiments, a TaqMan based assay is used to measure expression. 
TaqMan based assays use a fluorogenic oligonucleotide probe that contains a 5' fluorescent 
dye and a 3' quenching agent The probe hybridizes to a PCR product, but cannot itself be 
extended due to a blocking agent at the 3' end. When the PCR product is amplified in 
subsequent cycles, the 5' nuclease activity of the polymerase, e.g., AmpliTaq, results in the 
cleavage of the TaqMan probe. This cleavage separates the 5* fluorescent dye and the 3' 
quenching agent, thereby resulting in an increase in fluorescence as a function of 
amplification (see, e.g., hterature provided by Perkin-Elmer, e.g., www2.perkin-elmer.com). 

Other suitable amplification methods include, but are not limited to, ligase 
chain reaction (LCR) (see Wu & Wallace, Genomics 4:560 (1989), Landegren et aL, Science 
241:1077 (1988), and Barringer et aL, Gene 89: 1 17 (1990)), transcription amplification 
(Kwoh et aL, Proc Natl Acad ScL USA 86:1173 (1989)), self-sustained sequence replication 
(Guatelli et aL, Proc Nat. Acad. ScL USA 87:1874 (1990)), dot PCR, and linker adapter PCR, 
etc. 

Expression of metastatic colorectal cancer proteins from nucleic acids 

In a preferred embodiment, metastatic colorectal cancer nucleic acids, e.g., 
encoding metastatic colorectal cancer proteins, are used to make a variety of expression 
vectors to express metastatic colorectal cancer proteins which can then be used in screening 
assays, as described below. Expression vectors and recombinant DNA technology are well 
known to those of skill in die art (see, eg., Ansubel, sitpra, and Gene Expression Systems 
(Fernandez & Hoeffler, eds, 1999)) and are used to express proteins. The expression vectors 
may be either self-replicating extracfaromosomal vectors or vectors which integrate into a 
host genome. Generally, these expression vectors include transcriptional and translational 
regulatory nucleic acid operably linked to the nucleic acid encoding the metastatic colorectal 
cancer protein. The term "control sequences" refers to DNA sequences used for the 
expression of an operably linked coding sequence in a particular host organism. Control 
sequences that are suitable for prokaryotes, e.g., include a promoter, optionally an operator 
sequence, and a ribosome binding site. Eukaryotic cells are known to utilize promoters, 
polyadenylation signals, and enhancers. 
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Nucleic acid is "operably linked" when it is placed into a functional 
relationship with another nucleic acid sequence. For example, DNA for a presequence or 
secretory leader is operably linked to DNA for a polypeptide if it is expressed as a preprotein 
that participates in the secretion of the polypeptide; a promoter or enhancer is operably linked 
to a coding sequence if it affects the transcription of the sequence; or a ribosome binding site 
is operably linked to a coding sequence if it is positioned so as to facilitate translation. 
Generally, "operably linked" means that the DNA sequences being linked are contiguous, 
and, in the case of a secretory leader, contiguous and in reading phase. However, enhancers 
do not have to be contiguous. Linking is typically accomplished by ligation at convenient 
restriction sites. If such sites do not exist, synthetic oligonucleotide adaptors or linkers are 
used in accordance with conventional practice. Transcriptional and translational regulatory 
nucleic acid will generally be appropriate to the host cell used to express the metastatic 
colorectal cancer protein. Numerous types of appropriate expression vectors, and suitable f 
regulatory sequences are known in the art for a variety of host cells. 

In general, transcriptional and translational regulatory sequences may include, 
but are not limited to, promoter sequences, ribosomal binding sites, transcriptional start and 
stop sequences, translational start and stop sequences, and enhancer or activator sequences. 
In a preferred embodiment, the regulatory sequences include a promoter and transcriptional 
start and stop sequences. 

Promoter sequences encode either constitutive or inducible promoters. The 
promoters may be either naturally occurring promoters or hybrid promoters. Hybrid 
promoters, which combine elements of more than one promoter, are also known in the art, 
and are useful in the present invention 

In addition, an expression vector may comprise additional elements. For 
example, the expression vector may have two replication systems, thus allowing it to be 
maintained in twn organisms, eg., in mammalian «rmsect cells for expression and in a 
procaryotic host for cloning and amplification. Furthermore, for integrating expression 
vectors, the expression vector contains at least one sequence homologous to the host cell 
genome, and preferably two homologous sequences which flank the expression construct 
The integrating vector may be directed to a specif 

appropriate homologous sequence for inclusion in the vector. Constructs for integrating 
vectors are well known in the art (e.g., Fernandez & Hoeffler, supra). 
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In addition, in a preferred embodiment, the expression vector contains a 
selectable marker gene to allow the selection of transformed host cells. Selection genes are 
well known in the art and will vary with the host cell used 

The metastatic colorectal cancer proteins of the present invention are produced 
by culturing a host cell transformed with an expression vector containing nucleic acid 
encoding a metastatic colorectal cancer protein, under the appropriate conditions to induce or 
cause expression of the metastatic colorectal cancer protein. Conditions appropriate for 
metastatic colorectal cancer protein expression will vary with the choice of the expression 
vector and the ho st cell, and will be easily ascertained by one skilled in the art through 
routine experimentation or optimization. For example, the use of constitutive promoters in 
the expression vector will require optimizing the growth and proliferation of the host cell, 
while the use of an inducible promoter requires the appropriate growth conditions for 
induction. In addition, in some embodiments, the timing of the harvest is important For ~ 
example, the baculoviral systems used in insect cell expression are lytic viruses, and thus 
harvest time selection can be crucial for product yield. 

Appropriate host cells include yeast, bacteria, archaebacteria, fungi, and insect 
and animal cells, including mammalian cells. Of particular interest are Saccharomyces 
cerevisiae and other yeasts, K cott, Bacillus subtUis, Sf9 cells, C129 cells, 293 cells, 
Neurospora, BHK, CHO, COS, HeLa cells, HUVEC (human umbilical vein endothelial 
cells), THP1 cells (a macrophage cell line) and various other human cells and cell lines. 

In a preferred embodiment, the metastatic colorectal cancer proteins are 
expressed in mammalian cells. Mammalian expression systems are also known in the art, and 
include retroviral and adenoviral systems. Of particular use as mammalian promoters are the 
promoters from mammalian viral genes, since the viral genes are often highly expressed and 
have a broad host range. Examples include the S V40 early promoter, mouse mammary tumor 
virus LTR promoter, adenovirus major late promoter, hopes simplex virus promoter, and the 
CMV promoter (see, e.g., Fernandez & Hoeffler, supra). Typically, transcription termination 
and polyadenylation sequences recognized by mammalian cells are regulatory regions located 
3* to the translation stop codon and thus, together with the promoter elements, flank the 
coding sequence. Examples of transcription terminator and polyadenylation signals include 
those derived form SV40. 

Hie methods of introducing exogenous nucleic acid into mammalian hosts, as 
well as other hosts, is well known in the art, and will vary with the host cell used 
Techniques include dextran-mediated transfection, calcium phosphate precipitation, 
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polybrene mediated transfection, protoplast fusion, electroporation, viral infection, 
encapsulation of the polynucleotide^) in liposomes, and direct microinjection of the DNA 
into nuclei. 

In a preferred embodiment, metastatic colorectal cancer proteins are expressed 
in bacterial systems. Promoters from bacteriophage may also be used and are known in the 
art In addition, synthetic promoters and hybrid promoters are also useful; e.g., the tac 
promoter is a hybrid of the top and lac promoter sequences. Furthermore, a bacterial 
promoter can include naturally occurring promoters of non-bacterial origin that have the 
ability to bind bacterial RNA polymerase and initiate transcription, in addition to a 
functioning promote: sequence, an efficient ribosome binding site is desirable. The 
expression vector may also include a signal peptide sequence that provides for secretion of 
the metastatic colorectal cancer protein in bacteria. The protein is either secreted into the 
growth media (gram-positive bacteria) or into the periplasmic space, located between the _ 
inner and outer membrane of the cell (gram-negative bacteria). The bacterial expression " 
vector may also include a selectable marker gene to allow for the selection of bacterial strains 
that have been transformed. Suitable selection genes include genes which render the bacteria 
resistant to drugs such as ampicillin, chloramphenicol, erythromycin, kanamycin, neomycin 
and tetracycline. Selectable markers also include biosynthetic genes, such as those in the 
histidine, tryptophan and leucine biosynthetic pathways. These components are assembled 
into expression vectors. Expression vectors for bacteria are well known in the art, and 
include vectors for Bacillus subtilis, E. coli, Streptococcus? cremoris, and Streptococcus 
lividans, among others (e.g., Fernandez & Hoeffler, supra). The bacterial expression vectors 
are transformed into bacterial host cells using techniques well known in the art, such as 
calcium chloride treatment, electroporation, and others. 

In one embodiment, metastatic colorectal cancer proteins are produced in 
insect cells. Expression vectors for the transformation of insect cells, and in particular, 
baculovirus-based expression vectors, are well known in the art. 

In a preferred embodiment, metastatic colorectal cancer protein is produced in 
yeast cells. Yeast expression systems are well known in the art, and include expression 
vectors for Saccharomyces cerevisiae, Candida albicans and C. maltosa, Hansenula 
polymorphs Kluyveromycesfragilis andK lactis t Pichia guillerimondii and P. pastoris, 
Schizosaccharomyces pombe, and Yarrowia lipolytica. 

The metastatic colorectal cancer protein may also be made as a fusion protein, 
using techniques well known in the art Thus, e.g., for the creation of monoclonal antibodies, 
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if the desired epitope is small, the metastatic colorectal cancer protein may be fused to a 
carrier protein to form an nnmunogen. Alternatively, the metastatic colorectal cancer protein 
may be made as a fusion protein to increase expression for affinity purification purposes, or 
for other reasons. For example, when the metastatic colorectal cancer protein is a metastatic 
colorectal cancer peptide, the nucleic acid encoding the peptide may be linked to other 
nucleic acid for expression purposes. 

In a preferred embodiment, the metastatic colorectal cancer protein is purified 
or isolated after expression. Metastatic colorectal cancer proteins may be isolated or purified 
in a variety of appropriate ways. Standard purification methods include electrophoretic, 
molecular, immunological and chromatographic techniques, including ion exchange, 
hydrophobic, affinity, and reverse-phase HPLC chromatography, and chromatofocusing. For 
example, the metastatic colorectal cancer protein may be purified using a standard and- 
metastatic colorectal cancer protein antibody column. Ultrafiltration and diafiltration - 
techniques, in conjunction with protein concentration, are also useful. For general guidance 
in suitable purification techniques, see Scopes, Protein Purification (1982). The degree of 
purification necessary will vary depending on the use of the metastatic colorectal cancer 
protein. In some instances no purification will be necessary. 

Once expressed and purified if necessary, the metastatic colorectal cancer 
proteins and nucleic acids are useful in a number of applications. They may be used as 
immunoselection reagents, as vaccine reagents, as screening agents, etc. 

Variants of metastatic colorectal cancer proteins 

In one embodiment, the metastatic colorectal cancer proteins are derivative or 
variant metastatic colorectal cancer proteins as compared to the wild-type sequence. That is, 
as outlined more fully below, the derivative metastatic colorectal cancer peptide will often" 
contain at least one amino acid substitution, deletion or insertion, with amino acid 
substitutions being particularly preferred. The amino acid substitution, insertion or deletion 
may occur at a particular residue within the metastatic colorectal cancer peptide. 

Also included within one embodiment of metastatic colorectal cancer proteins 
of the present invention are amino acid sequence variants. These variants typically fall into 
one or more of three classes: substitutional, insertional or deletional variants. These variants 
ordinarily are prepared by site specific mutagenesis of nucleotides in the DNA encoding the 
metastatic colorectal cancer protein, using cassette or PCR mutagenesis or other techniques, 
to produce DNA encoding the variant, and thereafter expressing the DNA in recombinant cell 
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culture as outlined above. However, variant metastatic colorectal cancer protein fragments 
having up to about 100-150 residues may be prepared by in vitro synthesis. Amino acid 
sequence variants are characterized by the predetermined nature of the variation, a feature 
that sets them apart from naturally occuning allelic or interspecies variation of the metastatic 
colorectal cancer protein amino acid sequence. The variants typically exhibit the same 
qualitative biological activity as the naturally occurring analogue, although variants can also 
be selected which have modified characteristics as will be more fully outlined below. 

While the site or region for introducing an amino acid sequence variation is 
often predetermined, the mutation per se need not be predetermined. For example, in order to 
optimize the performance of a mutation at a given site, random mutagenesis may be 
conducted at the target codon or region and the expressed metastatic colorectal cancer 
variants screened for the optimal combination of desired activity. Techniques exist for 
making substitution mutations at predetermined sites in DNA having a known sequence, elg., 
M13 primer mutagenesis and PCR mutagenesis. Screening of the mutants is done using 
assays of metastatic colorectal cancer protein activities. 

Amino acid substitutions are typically of single residues; insertions usually 
will be on the order of from about 1 to 20 amino acids, although considerably larger 
insertions may be occasionally tolerated Deletions range from about 1 to about 20 residues, 
although in some cases deletions may be much larger. 

Substitutions, deletions, insertions or any combination thereof may be used to 
arrive at a final derivative. Generally these changes are done on a few amino acids to 
minimize the alteration of the molecule. Larger changes may be tolerated in certain 
circumstances. When small alterations in the characteristics of a metastatic colorectal cancer 
protein are desired, substitutions are generally made in accordance with the amino acid 
substitution chart provided in the definition section. 

Variants typically exhibit the same qualitative biological activity and will 
elicit the same immune response as the naturally-occurring analog, although variants also are 
selected to modify the characteristics of the metastatic colorectal cancer proteins as needed. 
Alternatively, the variant may be designed or reorganized such that the biological activity of 
the metastatic colorectal cancer protein is altered. For example, glycosylation sites may be 
altered or removed 

Covalent modifications of metastatic colorectal cancer polypeptides are 
included within the scope of this invention. One type of covalent modification includes 
reacting targeted amino acid residues of a metastatic colorectal cancer polypeptide with an 
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organic derivatizing agent that is capable of reacting with selected side chains or the N-or C- 
tenninal residues of a metastatic colorectal cancer polypeptide. Derivatization with 
bifunctional agents is useful, fin: instance, for crosslinking metastatic colorectal cancer 
polypeptides to a water-insoluble support matrix or surface for use in the method for 
purifying anti-metastatic colorectal cancer polypeptide antibodies or screening assays, as is 
more fully described below. Commonly used crosslinking agents include, e.g., 1,1- 
bis(diazoacetyl>2-phenylethane, glutaraldehyde, N-hydroxysuccinimide esters, e.g., esters 
with 4-azidosalicylic acid, homobifiinctional imidoesters, including disuccinimidyl esters 
such as 33 '^thiobis(succiiirmidylpropionate), bifunctional maleimides such as bis-N- 
maleimido-l,8-octane and agents such as metiiyl-3-((p-azidophenyl)dithio)piopioimidate. 

Other modifications include deamidation of glutaminyl and asparaginyl 
residues to the corresponding glutamyl and aspartyl residues, respectively, hydroxylation of 
proline and lysine, phosphorylation of hydroxyl groups of seryl, threonyl or tyrosyl residues, 
methylation of the y-amino groups of lysine, aiginine, and histidine side chains (Creigjiton, 
Proteins: Structure and Molecular Properties, pp. 79-86 (1983)), acetylation of the N- 
terminal amine, and amidation of any C-tenninal carboxyl group. 

Another type of covalent modification of the metastatic colorectal cancer 
polypeptide encompassed by this invention is an altered native glycosylation pattern of the 
polypeptide. "Altering the native glycosylation pattern" is intended herein to mean adding to 
or deleting one or more carbohydrate moieties of a native sequence metastatic colorectal 
cancer polypeptide. Glycosylation patterns can be altered in many ways. For example the 
use of different cell types to express metastatic colorectal cancer-associated sequences can 
result in different glycosylation patterns. 

Addition of glycosylation sites to metastatic colorectal cancer polypeptides 
may also be accomplished by altering the amino acid sequence thereof. The alteration may 
be made, e.g., by the addition o£ or substitution by, one or more serine or threonine residues 
to the native sequence metastatic colorectal cancer polypeptide (for O-linked glycosylation 
sites). The metastatic colorectal cancer amino acid sequence may optionally be altered 
through changes at the DNA level, particularly by mutating the DNA encoding the metastatic 
colorectal cancer polypeptide at preselected bases such that codons are generated that will 
translate into the desired amino acids. 

Another means of increasing the number of carbohydrate moieties on the 
metastatic colorectal cancer polypeptide is by chemical or enzymatic coupling of glycosides 
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to the polypeptide. Such methods are described in the art, e.g., in WO 87/05330, and in 
Aplin & Wriston, CRC CriL Rev. Biochem., pp. 259-306 (1981). 

Removal of carbohydrate moieties present on the metastatic colorectal cancer 
polypeptide may be accomplished chemically or enzymatically or by mutational substitution 
of codons encoding for amino acid residues that serve as targets for glycosylation. Chemical 
deglycosylation techniques are known in the art and described, for instance, by Hakimuddin, 
et al., Arch. Biochem. Biophys., 259:52 (1987) and by Edge et al, Anal. Biochem., i 18:131 
(1981). Enzymatic cleavage of carbohydrate moieties on polypeptides can be achieved by the 
use of a variety of endo-and exo-glycosidases as described by Thotakura et al. 9 Meth. 
EwymoU 138:350 (1987). 

Another type of covalent modification of metastatic colorectal cancer 
comprises linking the metastatic colorectal cancer polypeptide to one of a variety of 
nonproteinaceous polymers, e.g., polyethylene glycol, polypropylene glycol, or ~~- 
polyoxyalkylenes, in the manner set forth in U.S. Patent Nos. 4,640,835; 4,496,689; 
4,301,144; 4,670,417; 4,791,192 or 4,179,337. 

Metastatic colorectal cancer polypeptides of the present invention may also be 
modified in a way to form chimeric molecules comprising a metastatic colorectal cancer 
polypeptide fused to another, heterologous polypeptide or amino acid sequence. In one 
embodiment, such a chimeric molecule comprises a fusion of a metastatic colorectal cancer 
polypeptide with a tag polypeptide which provides an epitope to which an anti-tag antibody 
can selectively bind. The epitope tag is generally placed at the amino-or carboxyl-terminus 
of the metastatic colorectal cancer polypeptide. The presence of such epitope-tagged forms 
of a metastatic colorectal cancer polypeptide can be detected using an antibody against the 
tag polypeptide. Also, provision of the epitope tag enables the metastatic colorectal cancer 
polypeptide to be readily purified by affinity purification using an anti-tag antibody or - 
another type of affinity matrix that binds to the epitope tag. In an alternative embodiment, 
the chimeric molecule may comprise a fusion of a metastatic colorectal cancer polypeptide 
with an immunoglobulin or a particular region of an immunoglobulin. For a bivalent form of 
the chimeric molecule, such a fusion could be to the Fc region of an IgG molecule. 

Various tag polypeptides and their respective antibodies are well known and 
examples include poly-histidine (poly-his) or poly-histidme-glycine (poly-his-gly) tags; HIS6 
and metal chelation tags, the flu HA tag polypeptide and its antibody 12CA5 (Field et al., 
MoL Cell. Biol. 8:2159-2165 (1988)); the c-myc tag and the 8F9, 3C7, 6E10, G4, B7 and 
9E10 antibodies thereto (Evan et aL, Molecular and Cellular Biology 5:3610-3616 (1985)); 
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and the Herpes Simplex virus glycoprotein D (gD) tag and its antibody (Paborsky et aL, 
Protein Engineering 3(6):547-553 (1990)). Other tag polypeptides include the Flag-^eptide 
(Hopp et aL, BioTechnoIogy 6:1204-1210 (1988)); the KT3 epitope peptide (Martin et aL, 
Science 255:192-194 (1992)); tubulin epitope peptide (Skinner et aL, J. Biol Chenu 
266:15163-15166 (1991)); and the T7 gene 10 protein peptide tag (Lutz-Freyemiuth et aL, 
Proa NatL Acad. ScL USA 87:6393-6397 (1990)). 

Also included are other metastatic colorectal cancer proteins of the metastatic 
colorectal cancer family, and metastatic colorectal cancer proteins from other organisms, 
which are cloned and expressed as outlined below. Thus, probe or degenerate polymerase 
chain reaction (PCR) primer sequences may be used to find other related metastatic colorectal 
cancer proteins from primates or other organisms. As will be appreciated by those in the art, 
particularly useful probe and/or PCR primer sequences include unique areas of the metastatic 
colorectal cancer nucleic acid sequence. As is generally known in the art, preferred PCR - 
primers are from about 15 to about 35 nucleotides in length, with from about 20 to about 30 
being preferred, and may contain inosine as needed PCR reaction conditions are well known 
in the art (e.g., hmis, PCR Protocols, supra). 

Antibodies to metastatic colorectal cancer proteins 

In a preferred embodiment, when a metastatic colorectal cancer protein is to 
be used to generate antibodies, e.g., for immunotherapy or immunodiagnosis, the metastatic 
colorectal cancer protein should share at least one epitope or determinant with the full length 
protein. By "epitope" or "determinant* * herein is typically meant a portion of a protein which 
will generate and/or bind an antibody or T-cell receptor in the contract ofMHC. Thus, in 
most instances, antibodies made to a smaller metastatic colorectal cancer protein will be able 
to bind to the full-length protein, particularly linear epitopes. In a preferred embodiment, 4he 
epitope is unique; that is, antibodies generated to a unique epitope show little or no cross- 
reactivity. 

Methods of preparing polyclonal antibodies are well known (e.g., Coligan, 
supra; and Harlow & Lane, supra). Polyclonal antibodies can be raised in a mammal, e.g., 
by one or more injections of an immunizing agent and, if desired, an adjuvant Typically, the 
immunizing agent and/or adjuvant will be injected in the ma mmal by multiple subcutaneous 
or intraperitoneal injections. The immunizing agent may include a protein encoded by a 
nucleic acid of Tables 1 -26 or fragment thereof or a fusion protein thereof It may be usefiil 
to conjugate the immunizing agent to a protein known to be immunogenic in the mammal 
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being immunized Immunogenic proteins include, e.g., keyhole limpet hemocyanin, serum 
albumin, bovine thyroglobulin, and soybean trypsin inhibitor. Adjuvants include, e.g., 
Freund's complete adjuvant and MPL-TDM adjuvant (monophosphoiyl Lipid A, synthetic 
trehalose dicoiynomycolate). The immunization protocol may be selected by one skilled in 
the art 

The antibodies may, alternatively, be monoclonal antibodies. Monoclonal 
antibodies may be prepared using hybridoma methods, such as those described by Kohler & 
Milstein, Nature 256:495 (1975). In a hybridoma method, a mouse, hamster, or other 
appropriate host animal, is typically immunized with an immunizing agent to elicit 
lymphocytes that produce or are capable of producing antibodies that will specifically bind to 
the immunizing agent Alternatively, the lymphocytes may be immunized in vitro. The 
immunizing agent will typically include a polypeptide encoded by a nucleic acid of Tables 1- 
26, or fragment thereof or a fusion protein thereof. Generally, either peripheral blood - 
lymphocytes fTBLs") are used if cells of human origin are desired, or spleen cells or lymph 
node cells are used if non-human mammalian sources are desired. The lymphocytes are then 
fused with an immortalized cell line using a suitable fusing agent, such as polyethylene 
glycol, to form a hybridoma cell (Goding, Monoclonal Antibodies: Principles and Practice, 
pp. 59-103 (1986)). Immortalized cell lines are usually transformed mammalian cells, 
particularly myeloma cells of rodent, bovine and primate origin. Usually, rat or mouse 
myeloma cell lines are employed The hybridoma cells may be cultured in a suitable culture 
medium that preferably contains one or more substances that inhibit the growth or survival of 
the unfiised, immortalized cells. For example, if the parental cells lack die enzyme 
hypoxanfhine guanine phosphoribosyl transferase (HGPRT or HPRT), the culture medium 
for the hybridomas typically will include hypoxanthine, aminopterin, and thymidine ("HAT 
medium"), which substances prevent the growth of HGPRT-deficient cells. 

In one embodiment, the antibodies are bispecific antibodies. Bispecific 
antibodies are typically monoclonal, preferably human or humanized, antibodies that have 
binding specificities for at least two different antigens or that have binding specificities for 
two epitopes on the same antigen. In one embodiment, one of the binding specificities is for 
a protein encoded by a nucleic acid of Tables 1-26 or a fragment thereof the other one is for 
any other antigen, and preferably for a cell-surface protein or receptor or receptor subunit, 
preferably one that is tumor specific. Alternatively, tetramer-type technology may create 
multivalent reagents. 
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In a preferred embodiment, the antibodies to metastatic colorectal cancer 
protein are capable of reducing or eliminating a biological function of a metastatic colorectal 
cancer protein, as is described below. That is, the addition of anti-metastatic colorectal 
cancer protein antibodies (either polyclonal or preferably monoclonal) to metastatic 
colorectal cancer tissue (or cells containing metastatic colorectal cancer) may reduce or 
eliminate the metastatic colorectal cancer. Generally, at least a 25% decrease in activity, 
growth, size or the like is preferred, with at least about 50% being particularly preferred and 
about a 95-100% decrease being especially preferred 

hi a preferred embodiment the antibodies to the metastatic colorectal cancer 
proteins are humanized antibodies (e.g., Xenerex Biosciences, Mederex, Inc., Abgenix, Inc., 
Protein Design Labs, Inc.) Humanized forms of non-human (e.g., murine) antibodies are 
chimeric molecules of immunoglobulins, immunoglobulin chains or fragments thereof (such 
as Fv, Fab, Fab*, F(ab , )2 or other antigen-binding subsequences of antibodies) which contain 
minimal sequence derived from non-human immunoglobulin. Humanized antibodies include 
human immunoglobulins (recipient antibody) in which residues from a complementary 
determining region (CDR) of the recipient are replaced by residues from a CDR of a non- 
human species (donor antibody) such as mouse, rat or rabbit having the desired specificity, 
af fini ty and capacity. In some instances, Fv framework residues of the human 
immunoglobulin are replaced by corresponding non-human residues. Humanized antibodies 
may also comprise residues which are found neither in the recipient antibody nor in the 
imported CDR or framework sequences. In general, a humanized antibody will comprise 
substantially all of at least one, and typically two, variable domains, in which all or 
substantially all of the CDR regions correspond to those of a non-human immunoglobulin 
and all or substantially all of the framework (FR) regions are those of a human 
imm unoglobulin consensus sequence. The humanized antibody optimally also will comprise 
at least a portion of an immunoglobulin constant region (Fc), typically that of a human 
immunoglobulin (Jones et aL, Nature 321:522-525 (1986); Riechmarm et aL 9 Nature 
332:323-329 (1988); and Presta, Curr. Op. Struct Biol 2:593-596 (1992)). Humanization 
can be essentially performed following the method of Winter and co-workers (Jones et al., 
Nature 321:522-525 (1986); Riechmann et aL, Nature 332:323-327 (1988); Verhoeyen et aL, 
Science 239:1534-1536 (1988)), by substituting rodent CDRs or CDR sequences for the 
corresponding sequences of a human antibody. Accordingly, such humanized antibodies are 
chimeric antibodies (U.S. Patent No. 4,816,567), wherein substantially less than an intact 
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human variable domain has been substituted by the corresponding sequence from a non- 
human species. 

Human-like antibodies can also be produced using various techniques known 
in the art, including phage display libraries (Hoogenboom & Winter, J. Mol Biol 227:381 
(1991); Marks et al, /. Mol Biol 222:581 (1991)). The techniques of Cole et al and 
Boerner et al are also available for the preparation of human monoclonal antibodies (Cole et 
al, Monoclonal Antibodies and Cancer Therapy, p. 77 (1985) and Boerner et al, J. Immunol 
147(l):86-95 (1991)). Similarly, human antibodies can be made by introducing of human 
immunoglobulin loci into transgenic animals, e.g., mice in which the endogenous 
immunoglobulin genes have been partially or completely inactivated. Upon challenge, 
human antibody production is observed, which closely resembles that seen in humans in 
virtually all respects, including gene rearrangement, assembly, and antibody repertoire. This 
approach is described, e.g., in U.S. Patent Nos. 5,545,807; 5,545,806; 5,569,825; 5,625,126; 
5,633,425; 5,661,016, and in the following scientific publications: Marks et al, 
Bio/Technology 10:779-783 (1992); Lonbeig et al., Nature 368:856-859 (1994); Morrison, 
Nature 368:812-13 (1994); Fishwildef o£, Nature Biotechnology 14:845-51 (1996); 
Neuberger, Nature Biotechnology 14:826 (1996); Lonberg & Huszar, Intern. Rev. Immunol 
13:65-93 (1995). 

By immunotherapy is meant treatment of metastatic colorectal cancer with an 
antibody raised against a metastatic colorectal cancer proteins. As used herein, 
immunotherapy can be passive or active. Passive immunotherapy as defined herein is the 
passive transfer of antibody to a recipient (patient). Active immunization is the induction of 
antibody and/or T-cell responses in a recipient (patient). Induction of an immune response is 
the result of providing the recipient with an antigen to which antibodies are raised. The 
antigen may be provided by injecting a polypeptide against which antibodies are desired te be 
raised into a recipient, or contacting the recipient with a nucleic acid capable of expressing 
the antigen and under conditions for expression of the antigen, leading to an immune 
response. 

In a preferred embodiment the metastatic colorectal cancer proteins against 
which antibodies are raised are secreted proteins as described above. Without being bound 
by theory, antibodies used for treatment, bind and prevent the secreted protein from binding 
to its receptor, thereby inactivating the secreted metastatic colorectal cancer protein. 

In another preferred embodiment, the metastatic colorectal cancer protein to 
which antibodies are raised is a transmembrane protein. Without being bound by theory, 
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antibodies used for this treatment typically bind the extracellular domain of the metastatic 
colorectal cancer protein and prevent it from binding to other proteins, such as circulating 
Kgands or cell-associated molecules. The antibody may cause down-regulation of the 
transmembrane metastatic colorectal cancer protein. The antibody may be a competitive, 
non-competitive or uncompetitive inhibitor of protein binding to the extracellular domain of 
the metastatic colorectal cancer protein. The antibody may be an antagonist of the metastatic 
colorectal cancer protein or may prevent activation of the trans mem b r a n e metastatic 
colorectal cancer protein. In some embodiments, when the antibody prevents the binding of 
other molecules to the metastatic colorectal cancer protein, the antibody prevents growth of 
thecelL The antibody may also be used to target or sensitize the cell to cytotoxic agents, 
including, but not limited to TNF-ct, TNF-p, IL-1, INF-y and IL-2, or chemotherapeutic 
agents including 5FU, vinblastine, actinomycin D, cisplatin, methotrexate, and the like. In 
some instances the antibody belongs to a sub-type that activates serum complement when ~ 
complexed with the transmembrane protein thereby mediating cytotoxicity or antigen- 
dependent cytotoxicity (ADCQ. Thus, metastatic colorectal cancer is treated by 
administering to a patient antibodies directed against the transmembrane metastatic colorectal 
cancer protein. Antibody-labeling may activate a co-toxin, localize a toxin payload, or 
otherwise provide means to locally ablate cells. 

In another preferred embodiment, the antibody is conjugated to an effector 
moiety. The effector moiety can be any number of molecules, including labeling moieties 
such as radioactive labels or fluorescent labels, or can be a therapeutic moiety. In one aspect 
the therapeutic moiety is a small molecule that modulates the activity of the metastatic 
colorectal cancer protein. In another aspect the therapeutic moiety modulates the activity of 
molecules associated with or in close proximity to the metastatic colorectal cancer protein. 
The therapeutic moiety may inhibit enzymatic activity such as protease or collagenase 
activity associated with metastatic colorectal cancer. 

In a preferred embodiment, the therapeutic moiety can also be a cytotoxic 
agent In this method, targeting the cytotoxic agent to metastatic colorectal cancer tissue or 
cells results in a reduction in the number of afflicted cells, thereby reducing symptoms 
associated with metastatic colorectal cancer. Cytotoxic agents are numerous and varied and 
include, but are not limited to, cytotoxic drugs or toxins or active fragments of such toxins. 
Suitable toxins and their corresponding fragments include diphtheria A chain, exotoxin A 
chain, ricin A chain, abrin A chain, curcin, cretin, phenomycin, enomycin and the like. 
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Cytotoxic agents also include radiochemicals made by conjugating radioisotopes to 
antibodies raised against metastatic colorectal cancer proteins, or binding of a radionuclide to 
a chelating agent that has been covalently attached to the antibody. Targeting the therapeutic 
moiety to transmembrane metastatic colorectal cancer proteins not only serves to increase the 
local concentration of therapeutic moiety in the metastatic colorectal cancer afflicted area, but 
also serves to reduce deleterious side effects that may be associated with the therapeutic 
moiety. 

In another preferred embodiment, the metastatic colorectal cancer protein 
against which the antibodies are raised is an intracellular protein. In this case, the antibody 
may be conjugated to a protein or other entity which facilitates entry into the cell In one 
case, the antibody enters die cell by endocytosis. In another embodiment, a nucleic acid 
encoding the antibody is administered to the individual or cell. Moreover, wherein the 
metastatic colorectal cancer protein can be targeted within a cell, i.e., the nucleus, an - 
antibody thereto contains a signal for that target localization, i.e., a nuclear localization 
signal. 

The metastatic colorectal cancer antibodies of the invention specifically bind 
to metastatic colorectal cancer proteins. By "specifically bind" herein is meant that the 
antibodies bind to the protein with a Kd of at least about 0. 1 mM, more usually at least about 
1 jiM, preferably at least about 0.1 nM or better, and most preferably, 0.01 nM or better. 
Selectivity of binding is also important 

Detection of metastatic colorectal cancer sequence for diagnostic and therapeutic 
applications 

In one aspect, the RNA expression levels of genes are determined for different 
cellular states in the metastatic colorectal cancer phenotype. Expression levels of genes inr 
normal tissue (i.e., not undergoing metastatic colorectal cancer) and in metastatic colorectal 
cancer tissue (and in some cases, for varying severities of metastatic colorectal cancer that 
relate to prognosis, as outlined below) are evaluated to provide expression profiles. An 
expression profile of a particular cell state or point of development is essentially a 
"fingerprint" of the state. While two states may have any particular gene similarly expressed, 
the evaluation of a number of genes simultaneously allows the generation of a gene 
expression profile that is reflective of the state of the cell. By comparing expression profiles 
of cells in different states, information regarding which genes are important (including both 
up- and down-regulation of genes) in each of these states is obtained. Then, diagnosis may 
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be performed or confirmed to determine whether a tissue sample has the gene expression 
profile of normal or cancerous tissue. This will provide for molecular diagnosis of related 
conditions. 

"Differential expression," or grammatical equivalents as used herein, refers to 
qualitative or quantitative differences in the temporal and/or cellular gene expression 
patterns within and among cells and tissue. Thus, a differentially expressed gene can 
qualitatively have its expression altered, including an activation or inactivation, in, e.g., 
normal versus metastatic colorectal cancer tissue. Genes may be turned on or turned off in a 
particular state, relative to another state thus permitting comparison of two or more states. A 
qualitatively regulated gene will exhibit an expression pattern within a state or cell type 
which is detectable by standard techniques. Some genes will be expressed in one state or cell 
type, but not in both. Alternatively, the difference in expression may be quantitative, e.g., in 
that expression is increased or decreased; i.e., gene expression is either upregulated, resulting 
in an increased amount of transcript, or downregulated, resulting in a decreased amount of 
transcript The degree to which expression differs need only be large enough to quantify via 
standard characterization techniques as outlined below, such as by use of Asymetrix 
GeneChip™ expression arrays, Lockhart, Nature Biotechnology 14:1675-1680 (1996), 
hereby expressly incorporated by reference. Other techniques include, but are not limited to, 
quantitative reverse transcriptase PGR, northern analysis and RNase protection. As outlined 
above, preferably the change in expression (i.e., upregulation or downregulation) is typically 
at least about 50%, more preferably at least about 100%, more preferably at least about 
150%, more preferably at least about 200%, with from 300 to at least 1000% being especially 
preferred. 

Evaluation may be at the gene transcript, or the protein level. The amount of 
gene expression may be monitored using nucleic acid probes to the DNA or RNA equivalent 
of die gene transcript, and the quantification of gene expression levels, or, alternatively, the 
final gene product itself (protein) can be monitored, e.g., with antibodies to the metastatic 
colorectal cancer protein and standard immunoassays (ELISAs, etc.) or other techniques, 
including mass spectroscopy assays, 2D gel electrophoresis assays, etc. Proteins 
corresponding to metastatic colorectal cancer genes, i.e., those identified as being important 
in a metastatic colorectal cancer phenotype, can be evaluated in a metastatic colorectal cancer 
diagnostic test 

In a preferred embodiment, gene expression monitoring is performed 
simultaneously on a number of genes. 
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The metastatic colorectal cancer nucleic acid probes may be attached to 
biochips as outlined herein for the detection and quantification of metastatic colorectal cancer 
sequences in a particular cell. The assays are further described below in the example. PCR 
techniques can be used to provide greater sensitivity. Multiple protein expression monitoring 
can be performed as well. Similarly, these assays may be performed on an individual basis as 
well. 

In a preferred embodiment nucleic acids encoding the metastatic colorectal 
cancer protein are detected. Although DNA or RNA encoding the metastatic colorectal 
cancer protein may be detected, of particular interest are methods wherein an mRNA 
encoding a metastatic colorectal cancer protein is detected. Probes to detect mRNA can be a 
nucleotide/deoxynucleotide probe that is complementary to and hybridizes with the mRNA 
and includes, but is not limited to, oligonucleotides, cDNA or RNA. Probes also should 
contain a detectable label, as defined herein. In one method the mRNA is detected after ~ 
immobilizing the nucleic acid to be examined on a solid support such as nylon membranes 
and hybridizing the probe with the sample. Following washing to remove the non- 
specifically bound probe, the label is detected. In another method detection of the mRNA is 
performed in situ. In this method permeabilized cells or tissue samples are contacted with a 
detectably labeled nucleic acid probe for sufficient time to allow the probe to hybridize with 
the target mRNA. Following washing to remove the non-specifically bound probe, the label 
is detected For example a digoxygenin labeled riboprobe (RNA probe) that is 
complementary to the mRNA encoding a metastatic colorectal cancer protein is detected by 
binding the digoxygenin with an anti-digoxygenin secondary antibody and developed with 
nitro blue tetrazolium and 5-bromo-4-chloro-3-indoyl phosphate. 

In a preferred embodiment, various proteins from the three classes of proteins 
as described herein (secreted, transmembrane or intracellular proteins) are used in diagnostic 
assays. The metastatic colorectal cancer proteins, antibodies, nucleic acids, modified proteins 
and cells containing metastatic colorectal cancer sequences are used in diagnostic assays. 
This can be performed on an individual gene or corresponding polypeptide level In a 
preferred embodiment, the expression profiles are used, preferably in conjunction with high 
throughput screening techniques to allow monitoring for expression profile genes and/or 
corresponding polypeptides. 

As described and defined herein, metastatic colorectal cancer proteins, 
including intracellular, transmembrane or secreted proteins, find use as markers of metastatic 
colorectal cancer. Detection of these proteins in putative metastatic colorectal cancer tissue 

( 
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allows for detection or diagnosis of metastatic colorectal cancer. In one embodiment, 
antibodies are used to detect metastatic colorectal cancer proteins, A preferred method 
separates proteins from a sample by electrophoresis on a gel (typically a denaturing and 
reducing protein gel, but may be another type of gel, including isoelectric focusing gels and 
the like). Following separation of proteins, the metastatic colorectal cancer protein is 
detected, e.g., by immimoblotting with antibodies raised against the metastatic colorectal 
cancer protein. Methods of immimoblotting are well known to those of ordinary skill in the 
art 

In another preferred method, antibodies to the metastatic colorectal cancer 
protein find use in in situ imaging techniques, e.g., in histology (e.g., Methods in Cell 
Biology: Antibodies in Cell Biology, volume 37 (Asai, ed. 1993)). In this method cells are 
contacted with, from one to many antibodies to the metastatic colorectal cancer protein(s). 
Following washing to remove non-specific antibody binding, the presence of the antibody^ 
antibodies is detected. In one embodiment the antibody is detected by incubating with a 
secondary antibody that contains a detectable label, e.g., multicolor fluorescence or confocal 
imaging. In another method the primary antibody to the metastatic colorectal cancer 
protein(s) contains a detectable label, e.g., an enzyme marker that can act on a substrate. In 
another preferred embodiment each one of multiple primary antibodies contains a distinct 
and detectable label. This method finds particular use in simultaneous screening for a 
plurality of metastatic colorectal cancer proteins. Many other histological imaging 
techniques are also provided by the invention. 

In a preferred embodiment the label is detected in a fluorometer which has the 
ability to detect and distinguish emissions of different wavelengths. In addition, a 
fluorescence activated cell sorter (FACS) can be used in the method. 

M' another preferred embodiment, antibodies find use in diagnosing metastatic 
colorectal cancer from blood, serum, plasma, stool, and other samples. Such samples, 
therefore, are useful as samples to be probed or tested for the presence of metastatic 
colorectal cancer proteins. Antibodies can be used to detect a metastatic colorectal cancer 
protein by previously described immunoassay techniques including ELBA, immimoblotting 
(western blotting), immunoprecipitation, BIACORE technology and the like. Conversely, the 
presence of antibodies may indicate an immune response against an endogenous metastatic 
colorectal cancer protein or vaccine. 

In a preferred embodiment, in situ hybridization of labeled metastatic 
colorectal cancer nucleic acid probes to tissue arrays is done. For example, arrays of tissue 
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samples, including metastatic colorectal cancer tissue and/or normal tissue, are made. In situ 
hybridization (see, e.g., Ausubel, supra) is then performed When comparing the fingerprints 
between an individual and a standard, the skilled artisan can make a diagnosis, a prognosis, or 
a prediction based on the findings. It is further understood that the genes which indicate the 
diagnosis may differ from those which indicate the prognosis and molecular profiling of the 
condition of the cells may lead to distinctions between responsive or refractory conditions or 
may be predictive of outcomes. 

In a preferred embodiment, the metastatic colorectal cancer proteins, 
antibodies, nucleic acids, modified proteins and cells containing metastatic colorectal cancer 
sequences are used in prognosis assays. As above, gene expression profiles can be generated 
that correlate to metastatic colorectal cancer, in terms of long term prognosis. Again, this 
may be done on either a protein or gene level, with the use of genes being preferred. As 
above, metastatic colorectal cancer probes may be attached to biochips for the detection ami 
quantification of metastatic colorectal cancer sequences in a tissue or patient The assays 
proceed as outlined above for diagnosis. PCR method may provide more sensitive and 
accurate quantification. 

Assays for therapeutic compounds 

In a preferred embodiment members of the three classes of proteins as 
described herein are used in drug screening assays. The metastatic colorectal cancer proteins, 
antibodies, nucleic acids, modified proteins and cells containing metastatic colorectal cancer 
sequences are used in drug screening assays or by evaluating the effect of drug candidates on 
a "gene expression profile" or expression profile of polypeptides. In a preferred embodiment, 
the expression profiles are used, preferably in conjunction with high throughput screening 
techniques to allow monitoring for expression profile genes after treatment with a candidate 
agent (e.g., Zlokarnik, et al 9 Science 279:84-8 (1998); Heid, Genome Res 6:986-94, 1996). 

In a preferred embodiment, the metastatic colorectal cancer proteins, 
antibodies, nucleic acids, modified proteins and cells containing the native or modified 
metastatic colorectal cancer proteins are used in screening assays. That is, the present 
invention provides novel methods for screening for compositions which modulate the 
metastatic colorectal cancer phenotype or an identified physiological function of a metastatic 
colorectal cancer protein. As above, this can be done on an individual gene level or by 
evaluating the effect of drug candidates on a "gene expression profile". In a preferred 
embodiment, the expression profiles are used, preferably in conjunction with high throughput 
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screening techniques to allow monitoring for expression profile genes after treatment with a 
candidate agent, see Zlokarnik, sitpra. 

Having identified the differentially expressed genes herein, a variety of assays 
may be applied In a preferred embodiment, assays maybe run on an individual gene or 
protein level. That is, having identified a particular gene with altered regulation in metastatic 
colorectal cancer, test compounds can be screened for the ability to modulate gene expression 
or for binding to the metastatic colorectal cancer protein. "Modulation" thus includes an 
increase or a decrease in gene expression. The preferred amount of modulation will depend 
on the original change of die gene expression in normal versus tissue undergoing metastatic 
colorectal cancer, with changes of at least 10%, preferably 50%, more preferably 100-300%, 
and in some embodiments 300-1000% or greater. Thus, if a gene exhibits a 4-fold increase in 
metastatic colorectal cancer tissue compared to normal tissue, a decrease of about four-fold is 
often desired; similarly, a 10-fold decrease in metastatic colorectal cancer tissue comparechto 
normal tissue often provides a target value of a 10-fold increase in expression to be induced 
by the test compound. 

The amount of gene expression may be monitored using nucleic acid probes 
and the quantification of gene expression levels, or, alternatively, the gene product itself can 
be monitored, e.g., through the use of antibodies to the metastatic colorectal cancer protein 
and standard immunoassays. Proteomics and separation techniques may also allow 
quantification of expression. 

In a preferred embodiment, gene or protein expression monitoring of a number 
of entities, i.e., an expression profile, is monitored simultaneously. Such profiles will 
typically involve a plurality of those entities described herein. 

In this embodiment, the metastatic colorectal cancer nucleic acid probes are 
attached to biochips as outlined herein for the detection and quantification of metastatic - 
colorectal cancer sequences in a particular cell Alternatively, PCR may be used Thus, a 
series, e.g., of microtiter plate, may be used with dispensed primers in desired wells. A PCR 
reaction can then be performed and analyzed for each well. 

Expression monitoring can be performed to identify compounds that modify 
the expression of one or more metastatic colorectal cancer-associated sequences, e.g., a 
polynucleotide sequence set out in Tables 1-26. Generally, in a preferred embodiment, a test 
compound is added to the cells prior to analysis. Moreover, screens are also provided to 
identify agents that modulate metastatic colorectal cancer, modulate metastatic colorectal 
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cancer proteins, bind to a metastatic colorectal cancer protein, or interfere with the binding of 
a metastatic colorectal cancer protein and an antibody, substrate, or other binding partner. 

The term "test compound" or "drug candidate" or "modulator" or grammatical 
equivalents as used herein describes any molecule, e.g., protein, oligopeptide, small organic 
molecule, polysaccharide, polynucleotide, etc., to be tested for the capacity to directly or 
indirectly alter the metastatic colorectal cancer phenotype or the expression of a metastatic 
colorectal cancer sequence, e.g., a nucleic acid or protein sequence. In preferred 
embodiments, modulators alter expression profiles of nucleic acids or proteins provided 
herein. In one embodiment, the modulator suppresses a metastatic colorectal cancer 
phenotype, e.g., to a normal tissue fingerprint In another embodiment, a modulator induces a 
metastatic colorectal cancer phenotype. Generally, a plurality of assay mixtures are run in 
parallel with different agent concentrations to obtain a differential response to the various 
concentrations. Typically, one of these concentrations serves as a negative control, Le., at~ 
zero concentration or below the level of detection. 

In one aspect, a modulator will neutralize the effect of a metastatic colorectal 
cancer protein. By ''neutralize" is meant that activity of a protein and the consequent effect 
on the cell is inhibited or blocked. 

In certain embodiments, combinatorial libraries of potential modulators will be 
screened for an ability to bind to a metastatic colorectal cancer polypeptide or to modulate 
activity. Conventionally, new chemical entities with useful properties are generated by 
identifying a chemical compound (called a "lead compound") with some desirable property 
or activity, e.g., inhibiting activity, creating variants of the lead compound, and evaluating the 
property and activity of those variant compounds. Often, high throughput screening (HTS) 
methods are employed for such an analysis. 

In one preferred embodiment, high throughput screening methods involve ~ 
providing a library containing a large number of potential therapeutic compounds (candidate 
compounds). Such "combinatorial chemical libraries" are then screened in one or more 
assays to identify those library members (particular chemical species or subclasses) that 
display a desired characteristic activity. The compounds thus identified can serve as 
conventional "lead compounds" or can themselves be used as potential or actual therapeutics. 

A combinatorial chemical library is a collection of diverse chemical 
compounds generated by either chemical synthesis or biological synthesis by combining a 
number of chemical "building blocks" such as reagents. For example, a linear combinatorial 
chemical library, such as a polypeptide (e.g., mutein) library, is formed by combining a set of 
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chemical building blocks called amino acids in every possible way for a given compound 
length (Le., the number of amino acids in a polypeptide compound). Millions of chemical 
compounds can be synthesized through such combinatorial mixing of chemical building 
blocks (Gallop etaL, J. Med Chem. 37(9): 1233-1251 (1994)). 

Preparation and screening of combinatorial chemical libraries is well known to 
those of skill in die art Such combinatorial chemical libraries include, but are not limited to, 
peptide libraries (see, e.g, U.S. Patent No. 5,010,175, Furica, Pept Prot Res. 37:487-493 
(1991), Houghton etaL, Nature, 354:84-88 (1991)), peptoids (PCT Publication No WO 
91/19735), encoded peptides (PCT Publication WO 93/20242), random bio-oligomers (PCT 
Publication WO 92/00091), benzodiazepines (U.S. Pat No. 5,288,514), diversomers such as 
hydantoins, benzodiazepines and dipeptides (Hobbs et al.\ Proa Nat. Acad. ScL USA 
90:6909-6913 (1993)), vinylogous polypeptides (Hagihara et aL, J. Amer. Chem. Soc 
1 14:6568 (1992)), nonpeptidal peptidomimetics with a Beta-D-Glucose scaffolding 
(ffirschmann et aL,J. Amer. Chem. Soc 114:9217-9218 (1992)), analogous organic syntheses 
of small compound libraries (Choi et aL, J. Amer. Chem. Soc. 1 16:2661 (1994)), 
oligocarbamates (Cho, et aL, Science 261:1303 (1993)), and/or peptidyl phosphonates 
(Campbell et al. s J. Org. Chem. 59:658 (1994)). See, generally, Gordon et aL, J. Med. Chem. 
37:1385 (1994), nucleic acid libraries (see, eg., Strategene, Corp.), peptide nucleic acid 
libraries (see, eg., U.S. Patent 5,539,083), antibody libraries (see, e.g., Vaughn et aL, Nature 
Biotechnology 14(3):309-314 (1996), and PCT/US96/10287), carbohydrate libraries (see, 
eg., Iiang et al„ Science 274: 1520-1522 (1996), and U.S. Patent No. 5,593,853), and small 
organic molecule libraries (see, eg., benzodiazepines, Baum, C&EN, Jan 18, page 33 (1993); 
isoprenoids, U.S. Patent No. 5,569,588; thiazolidinones and metathiazanones, U.S. Patent No. 
5,549,974; pyrrolidines, U.S. Patent Nos. 5,525,735 and 5,519,134; morpholino compounds, 
U.S. Patent No. 5,506,337; benzodiazepines, U.S. Patent No. 5,288,514; and the like). - 

Devices for the preparation of combinatorial libraries are commercially 
available (see, e.g., 357 MPS, 390 MPS, Advanced Chem Tech, Louisville KY, Symphony, 
Rainin, Woburn, MA, 433 A Applied Biosystems, Foster City, CA, 9050 Plus, Millipore, 
Bedford, MA). 

A number of well known robotic systems have also been developed for 
solution phase chemistries. These systems include automated workstations like the 
automated synthesis apparatus developed by Takeda Chemical Industries, LTD. (Osaka, 
Japan) and many robotic systems utilizing robotic arms (Zymate II, Zymark Corporation, 
Hopktaton, Mass.; Orca, Hewlett-Packard, Palo Alto, Calif), which mimic the manual 



57 



WO 02/068677 



PCIYUS02/06001 



synthetic operations performed by a chemist The above devices, with expropriate 
modification, are suitable for use with the present invention. In addition, numerous 
combinatorial libraries are themselves commercially available (see, e.g., ComGenex, 
Princeton, NJ., Asinex, Moscow, Ru, Tripos, Inc., St Louis, MO, ChemStar, Ltd, Moscow, 
RU, 3D Pharmaceuticals, Exton, PA, Martek Biosciences, Columbia, MD, etc.)* 

The assays to identify modulators are amenable to high throughput screening. 
Preferred assays thus detect modulation of metastatic colorectal cancer gene transcription, 
polypeptide expression, and polypeptide activity. 

High throughput assays for evaluating the presence, absence, quantification, or 
other properties of particular nucleic acids or protein products are well known to those of 
skill in the art Similarly, binding assays and reporter gene assays are similarly well known. 
Thus, e.g., U.S. Patent No. 5,559,410 discloses high throughput screening methods for 
proteins, U.S. Patent No. 5,585,639 discloses high throughput screening methods for nucleic 
acid binding (i.e., in arrays), while U.S. Patent Nos. 5,576,220 and 5,541,061 disclose high 
throughput methods of screening for ligand/antibody binding. 

In addition, high throughput screening systems are commercially available 
(see, e.g., Zymark Corp., Hopkinton, MA; Air Technical Industries, Mentor, OH; Beckman 
Instruments, Inc. Fullerton, CA; Precision Systems, Inc., Natick, MA, etc.). These systems 
typically automate procedures, including sample and reagent pipetting, liquid dispensing, 
timed incubations, and final readings of the microplate in detectors) appropriate for the 
assay. These configurable systems provide high throughput and rapid start up as well as a 
high degree of flexibility and customization. The manufacturers of such systems provide 
detailed protocols for various high throughput systems. Thus, e.g., Zymark Corp. provides 
technical bulletins describing screening systems for detecting the modulation of gene 
transcription, ligand binding, and the like. 

In one embodiment, modulators are proteins, often naturally occurring 
proteins or fragments of naturally occurring proteins. Thus, e.g., cellular extracts containing 
proteins, or random or directed digests of proteinaceous cellular extracts, may be used. In 
this way libraries of proteins may be made for screening in the methods of the invention. 
Particularly preferred in this embodiment are libraries of bacterial, fungal, viral, and 
mammalian proteins, with the latter being preferred, and human proteins being especially 
preferred. Particularly useful test compound will be directed to the class of proteins to which 
the target belongs, e.g., substrates for enzymes or ligands and receptors. 
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In a preferred embodiment, modulators are peptides of from about 5 to about 
30 amino acids, with from about 5 to about 20 amino acids being preferred, and from about 7 
to about 15 being particularly preferred. The peptides may be digests of naturally occurring 
proteins as is outlined above, random peptides, or "biased^-random peptides. By 
"randomized" or grammatical equivalents herein is meant that the nucleic acid or peptide 
consists of essentially random sequences of nucleotides and amino acids, respectively. Since 
these random peptides (or nucleic acids, discussed below) are often chemically synthesized, 
they may incorporate any nucleotide or amino acid at any position. The synthetic process can 
be designed to generate randomized proteins or nucleic acids, to allow the formation of all or 
most of the possible combinations over the length of the sequence, thus forming a library of 
randomized candidate bioactive proteinaceous agents. 

In one embodiment, the library is fully randomized, with no sequence 
preferences or constants at any position. In a preferred embodiment, the library is biased. ~ 
That is, some positions within the sequence are either held constant, or are selected from a 
limited number of possibilities. In a preferred embodiment, the nucleotides or amino acid 
residues are randomized within a defined class, e.g., of hydrophobic amino acids, hydrophilic 
residues, sterically biased (either small or large) residues, towards the creation of nucleic acid 
binding domains, the creation of cysteines, for cross-linking, prolines for SH-3 domains, 
serines, threonines, tyrosines or histidines for phosphorylation sites, etc. 

Modulators of metastatic colorectal cancer can also be nucleic acids, as 

defined above. 

As described above generally for proteins, nucleic acid modulating agents may 
be naturally occurring nucleic acids, random nucleic acids, or "biased'* random nucleic acids. 
Digests of procaryotic or eucaryotic genomes may be used as is outlined above for proteins. 

In a preferred embodiment, the candidate compounds are organic chemical- 
moieties, a wide variety of which are available in the literature. 

After a candidate agent has been added and the cells allowed to incubate for 
some period of time, the sample containing a target sequence is analyzed. If required, the 
target sequence is prepared using known techniques. For example, the sample may be treated 
to lyse the cells, using known lysis buffers, electroporation, etc., with purification and/or 
amplification such as PCR performed as appropriate. For example, an in vitro transcription 
with labels covalently attached to the nucleotides is performed Generally, the nucleic acids 
are labeled with biotin-FTTC or PE, or with cy3 or cy5. 
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In a preferred embodiment, the target sequence is labeled with, e.g., a 
fluorescent, a chemiluminescent, a chemical, or a radioactive signal, to provide a means of 
detecting the target sequence's specific binding to a probe. The label also can be an enzyme, 
such as, alkaline phosphatase or horseradish peroxidase, which when provided with an 
appropriate substrate produces a product that can be detected. Alternatively, the label can be 
a labeled compound or small molecule, such as an enzyme inhibitor, that binds but is not 
catalyzed or altered by the enzyme. The label also can be a moiety or compound, such as, an 
epitope tag or biotin which specifically binds to streptavidin. For the example of biotin, the 
streptavidin is labeled as described above, thereby, providing a detectable signal for the 
bound target sequence. Unbound labeled streptavidin is typically removed prior to analysis. 

Nucleic acid assays can be direct hybridization assays or can comprise 
"sandwich assays", which include the use of multiple probes, as is generally outlined in U.S. 
Patent Nos. 5,681,702, 5,597,909, 5,545,730, 5,594,117, 5,591,584, 5,571,670, 5,580,731,; 
5,571,670, 5,591,584, 5,624,802, 5,635,352, 5,594,118, 5,359,100, 5,124,246 and 5,681,697, 
all of which are hereby incorporated by reference. In this embodiment, in general, the target 
nucleic acid is prepared as outlined above, and then added to the biochip comprising a 
plurality of nucleic acid probes, under conditions that allow the formation of a hybridization 
complex. 

A variety of hybridization conditions may be used in the present invention, 
including high, moderate and low stringency conditions as outlined above. The assays are 
generally run under stringency conditions which allow formation of the label probe 
hybridization complex only in the presence of target. Stringency can be controlled by 
altering a step parameter that is a thermodynamic variable, including, but not limited to, 
temperature, formamide concentration, salt concentration, chaotropic salt concentration, pH, 
organic solvent concentration, etc. 

These parameters may also be used to control non-specific binding, as is 
generally outlined in U.S. Patent No. 5,68 1,697. Thus it may be desirable to perform certain 
steps at higher stringency conditions to reduce non-specific binding. 

The reactions outlined herein may be accomplished in a variety of ways. 
Components of the reaction may be added simultaneously, or sequentially, in different orders, 
with preferred embodiments outlined below. In addition, the reaction may include a variety 
of other reagents. These include salts, buffers, neutral proteins, e.g., albumin, detergents, etc. 
which may be used to facilitate optimal hybridization and detection, and/or reduce non- 
specific or background interactions. Reagents that otherwise improve the efficiency of the 
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assay, such as protease inhibitors, nuclease inhibitors, anti-microbial agents, etc, may also be 
used as appropriate, depending on die sample preparation methods and purity of die target 

The assay data are analyzed to determine the expression levels, and changes in 
expression levels as between states, of individual genes, forming a gene expression profile. 

Screens are performed to identify modulators of the metastatic colorectal 
cancer phenotype. In one embodiment, screening is performed to identify modulators that 
can induce or suppress a particular expression profile, thus preferably generating die 
associated phenotype. In another embodiment, e.g., for diagnostic applications, having 
identified differentially expressed genes important in a particular state, screens can be 
performed to identify modulators that alter expression of individual genes. In an another 
embodiment, screening is performed to identify modulators that alter a biological function of 
die expression product of a differentially expressed gene. Again, having identified die 
importance of a gene in a particular state, screens are performed to identify agents that bind 
and/or modulate the biological activity of die gene product, or evaluate genetic 
polymorphisms. 

Genes can be screened for those that are induced in response to a candidate 
agent After identifying a modulator based upon its ability to suppress a metastatic colorectal 
cancer expression pattern leading to a normal expression pattern, or to modulate a single 
metastatic colorectal cancer gene expression profile so as to mimic the expression of the gene 
from normal tissue, a screen as described above can be performed to identify genes that are 
specifically modulated in response to the agent Comparing expression profiles between 
normal tissue and agent treated metastatic colorectal cancer tissue reveals genes that are not 
expressed in normal tissue or metastatic colorectal cancer tissue, but are expressed in agent 
treated tissue. These agent-specific sequences can be identified and used by methods 
described herein for metastatic colorectal cancer genes or proteins. In particular these 
sequences and the proteins they encode find use in marking or identifying agent treated cells. 
In addition, antibodies can be raised against the agent induced proteins and used to target 
novel therapeutics to die treated metastatic colorectal cancer tissue sample. 

Thus, in one embodiment, a test compound is administered to a population of 
metastatic colorectal cancer cells, that have an associated metastatic colorectal cancer 
expression profile. By "administration" or "contacting" herein is meant that the candidate 
agent is added to the cells in such a manner as to allow the agent to act upon die cell, whether 
by uptake and intracellular action, or by action at the cell surface. In some embodiments, 
nucleic acid encoding a proteinaceous candidate agent (Le., a peptide) may be put into a viral 
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construct such as an adenoviral or retroviral construct, and added to the cell, such that 
expression of the peptide agent is accomplished, e.g., PCT US97/01019. Regulatable gene 
therapy systems can also be used. 

Once the test compound has been administered to the cells, the cells can be 
washed if desired and are allowed to incubate under preferably physiological conditions for 
some period of time. The cells are then harvested and a new gene expression profile is 
generated, as outlined herein. 

Thus, e.g., metastatic colorectal cancer tissue may be screened for agents that 
modulate, e.g., induce or suppress the metastatic colorectal cancer phenotype. A change in at 
least one gene, preferably many, of the expression profile indicates that the agent has an 
effect on metastatic colorectal cancer activity. By defining such a signature for the metastatic 
colorectal cancer phenotype, screens for new drugs that alter the phenotype can be devised 
With this approach, the drug target need not be known and need not be represented in the ~ 
original expression screening platform, nor does the level of transcript for the target protein 
need to change. 

Measure of metastatic colorectal cancer polypeptide activity, or of metastatic 
colorectal cancer or the metastatic colorectal cancer phenotype can be performed using a 
variety of assays. For example, the effects of the test compounds upon the function of the 
metastatic polypeptides can be measured by examining parameters described above. A 
suitable physiological change that affects activity can be used to assess the influence of a test 
compound on the polypeptides of this invention. When the functional consequences are 
determined using intact cells or animals, one can also measure a variety of effects such as, in 
the case of metastatic colorectal cancer associated with tumors, tumor growth, tumor 
metastasis, neovascularization, hormone release, transcriptional changes to both known and 
uncharacterized genetic markers (e.g., northern blots), changes in cell metabolism such ascell 
growth or pH changes, and changes in intracellular second messengers such as cGMP. In the 
assays of the invention, mammalian metastatic colorectal cancer polypeptide is typically 
used, e.g., mouse, preferably human. 

Assays to identify compounds with modulating activity can be performed in 
vitro. For example, a colorectal cancer polypeptide is first contacted with a potential 
modulator and incubated for a suitable amount of time, e.g., from 0.5 to 48 hours. In one 
embodiment, the metastatic colorectal cancer polypeptide levels are determined in vitro by 
measuring the level of protein or mRNA The level of protein is measured using 
immunoassays such as western blotting, ELISA and the like with an antibody that selectively 
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binds to the metastatic colorectal cancer polypeptide or a fragment thereof For measurement 
ofmRNA, amplification, e.g., using PCR, LCR, or hybridization assays, e.g., northern 
hybridization, RNAse protection, dot blotting, are preferred. The level of protein or mRNA 
is detected using directly or indirectly labeled detection agents, e.g., fluorescently or 
radioactively labeled nucleic acids, radioactively or enzymafically labeled antibodies, and the 
like, as described herein. 

Alternatively, a reporter gene system can be devised using the metastatic 
colorectal cancer protein promoter operably linked to a reporter gene such as luciferase, green 
fluorescent protein, CAT, or p-gal. The reporter construct is typically transfected into a cell. 
After treatment with a potential modulator, the amount of reporter gene transcription, 
translation, or activity is measured according to standard techniques known to those of skill 
in the art 

In a preferred embodiment, as outlined above, screens may be done on ~ 
individual genes and gene products (proteins). That is, having identified a particular 
differentially expressed gene as important in a particular state, screening of modulators of the 
expression of die gene or the gene product itself can be done. The gene products of 
differentially expressed genes are sometimes referred to herein as "metastatic colorectal 
cancer proteins." The metastatic colorectal cancer protein may be a fragment, or 
alternatively, be the full length protein to a fragment shown herein. 

In one embodiment, screening for modulators of expression of specific genes 
is performed. Typically, the expression of only one or a few genes are evaluated, in another 
embodiment, screens are designed to first find compounds that bind to differentially 
expressed proteins. These compounds are then evaluated for the ability to modulate 
differentially expressed activity. Moreover, once initial candidate compounds are identified, 
variants can be further screened to better evaluate structure activity relationships. 

In a preferred embodiment, binding assays are done. In general, purified or 
isolated gene product is used; that is, the gene products of one or more differentially 
expressed nucleic acids are made. For example, antibodies are generated to the protein gene 
products, and standard immunoassays are run to determine the amount of protein present. 
Alternatively, cells comprising the metastatic colorectal cancer proteins can be used in Che 
assays. 

Thus, in a preferred embodiment, the methods comprise combining a 
metastatic colorectal cancer protein and a candidate compound, and determining the binding 
of the compound to the metastatic colorectal cancer protein. Preferred embodiments utilize 
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the human metastatic colorectal cancer protein, although other mammalian proteins may also 
be used, e.g., for the development of animal models of human disease. In some embodiments, 
as outlined herein, variant or derivative metastatic colorectal cancer proteins may be used 

Generally, in a preferred embodiment of the methods herein, the metastatic 
colorectal cancer protein or the candidate agent is non-diffiisably bound to an insoluble 
support having isolated sample receiving areas (e.g., a microliter plate, an array, etc.). The 
insoluble supports may be made of any composition to which the compositions can be bound, 
is readily separated from soluble material, and is otherwise compatible with the overall 
method of screening. The surface of such supports may be solid or porous and of any 
convenient shape. Examples of suitable insoluble supports include microliter plates, arrays, 
membranes and beads. These are typically made of glass, plastic (e.g., polystyrene), 
polysaccharides, nylon or nitrocellulose, teflon™, etc. Microtiter plates and arrays are 
especially convenient because a large number of assays can be carried out simultaneously," - 
using small amounts of reagents and samples. The particular manner of binding of the 
composition is not crucial so long as it is compatible with the reagents and overall methods of 
the invention, maint ains the activity of the composition and is nondiffusable. Preferred 
methods of binding include the use of antibodies (which do not sterically block either the 
ligand binding site or activation sequence when the protein is bound to the support), direct 
binding to "sticky" or ionic supports, chemical crosslinking, the synthesis of the protein or 
agent on the surface, etc. Following binding of the protein or agent, excess unbound material 
is removed by washing. The sample receiving areas may then be blocked through incubation 
with bovine serum albumin (BSA), casein or other innocuous protein or other moiety. 

In a preferred embodiment, the metastatic colorectal cancer protein is bound to 
the support, and a test compound is added to the assay. Alternatively, the candidate agent is 
bound to the support and the metastatic colorectal cancer protein is added. Novel binding " 
agents include specific antibodies, non-natural binding agents identified in screens of 
chemical libraries, peptide analogs, etc. Of particular interest are screening assays for agents 
that have a low toxicity for human cells. A wide variety of assays may be used for this 
purpose, including labeled in vitro protein-protein binding assays, electrophoretic mobility 
shift assays, immunoassays for protein binding, functional assays (phosphorylation assays, 
etc.) and the like. 

The determination of the binding of the test modulating compound to the 
metastatic colorectal cancer protein may be done in a number of ways. In a preferred 
embodiment, the compound is labeled, and binding determined directly, e.g., by attaching all 
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or a portion of the metastatic colorectal cancer protein to a solid support, adding a labeled 
candidate, agent (e.g., a fluorescent label), washing off excess reagent, and determining 
whether the label is present on the solid support Various blocking and washing steps may be 
utilized as appropriate. 

In some embodiments, only one of the components is labeled, e.g., the 
proteins (or proteinaceous candidate compounds) can be labeled. Alternatively, more than 
one component can be labeled with different labels, e.g., 125 I for the proteins and a fluorophor 
for the compound. Proximity reagents, e.g., quenching or energy transfer reagents are also 
useful. 

In one embodiment, the binding of the test compound is determined by 
competitive binding assay. The competitor is a binding moiety known to bind to the target 
molecule (i.e., a metastatic colorectal cancer protein), such as an antibody, peptide, binding 
partner, ligand, etc. Under certain circumstances, there may be competitive binding between 
the compound and the binding moiety, with the binding moiety displacing the compound In 
one embodiment, the test compound is labeled Either the compound, or the competitor, or 
both, is added first to the protein for a time sufficient to allow binding, if present 
Incubations may be performed at a temperature which facilitates optimal activity, typically 
between 4 and 40°C. Incubation periods are typically optimized, e.g., to facilitate rapid high 
throughput screening. Typically between 0. 1 and 1 hour will be sufficient Excess reagent is 
generally removed or washed away. The second component is then added, and the presence 
or absence of the labeled component is followed, to indicate binding. 

In a preferred embodiment, the competitor is added first, followed by the test 
compound Displacement of the competitor is an indication that the test compound is binding 
to the metastatic colorectal cancer protein and thus is capable of binding to, and potentially 
modulating, the activity of the metastatic colorectal cancer protein. In this embodiment, -~ 
either component can be labeled. Thus, e.g., if the competitor is labeled, the presence of label 
in die wash solution indicates displacement by the agent Alternatively, if the test compound 
is labeled, the presence of the label on the support indicates displacement 

In an alternative embodiment, the test compound is added first, with 
incubation and washing, followed by the competitor. The absence of binding by the 
competitor may indicate that the test compound is bound to the metastatic colorectal cancer 
protein with a higher affinity. Thus, if the test compound is labeled, the presence of the label 
on the support, coupled with a lack of competitor binding, may indicate that the test 
compound is capable of binding to the metastatic colorectal cancer protein. 
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In a preferred embodiment, the methods comprise differential screening to 
identity agents that are capable of modulating die activity of the metastatic colorectal cancer 
proteins. In this embodiment, the methods comprise combining a metastatic colorectal cancer 
protein and a competitor in a first sample. A second sample comprises a test compound, a 
metastatic colorectal cancer protein, and a competitor. The binding of the competitor is 
determined for both samples, and a change, or difference in binding between the two samples 
indicates the presence of an agent capable of binding to the metastatic colorectal cancer 
protein and potentially modulating its activity. That is, if the binding of the competitor is 
different in the second sample relative to the first sample, the agent is capable of binding to 
the metastatic colorectal cancer protein. 

Alternatively, differential screening is used to identify drug candidates that 
bind to the native metastatic colorectal cancer protein, but cannot bind to modified metastatic 
colorectal cancer proteins. The structure of the metastatic colorectal cancer protein may be^ 
modeled, and used in rational drug design to synthesize agents that interact with that site. 
Drug candidates that affect the activity of a metastatic colorectal cancer protein are also 
identified by screening drugs for the ability to either enhance or reduce the activity of the 
protein. 

Positive controls and negative controls may be used in the assays. Preferably 
control and test samples are performed in at least triplicate to obtain statistically significant 
results. Incubation of all samples is for a time sufficient for the binding of the agent to the 
protein. Following incubation, samples are washed free of non-specifically bound material 
and the amount of bound, generally labeled agent determined. For example, where a 
radiolabel is employed, the samples may be counted in a scintillation counter to determine the 
amount of bound compound. 

A variety of other reagents may be included in the screening assays. These" 
include reagents like salts, neutral proteins, e.g., albumin, detergents, etc. which may be used 
to facilitate optimal protein-protein binding and/or reduce non-specific or background 
interactions. Also reagents that otherwise improve the efficiency of the assay, such as 
protease inhibitors, nuclease inhibitors, anti-microbial agents, etc., may be used. The mixture 
of components may be added in an order that provides for the requisite binding. 

In a preferred embodiment, the invention provides methods for screening for a 
compound capable of modulating the activity of a metastatic colorectal cancer protein. The 
methods comprise adding a test compound, as defined above, to a cell comprising metastatic 
colorectal cancer proteins. Preferred cell types include almost any cell. The cells contain a 
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recombinant nucleic acid that encodes a metastatic colorectal cancer protein. In a preferred 
embodiment, a library of candidate agents are tested on a plurality of cells. 

In one aspect, the assays are evaluated in the presence or absence or previous 
or subsequent exposure of physiological signals, e.g., hormones, antibodies, peptides, 
antigens, cytokines, growth factors, action potentials, pharmacological agents including 
chemotherapeutics, radiation, carcinogenics, or other cells (i.e. cell-cell contacts). In another 
example, the determinations are determined at different stages of the cell cycle process. 

In this way, compounds that modulate metastatic colorectal cancer agents are 
identified. Compounds with pharmacological activity are able to enhance or interfere with 
the activity of the metastatic colorectal cancer protein. Once identified, similar structures are 
evaluated to identify critical structural feature of the compound. 

In one embodiment, a method of inhibiting metastatic colorectal cancer cell 
division is provided. The method comprises administration of a metastatic colorectal cancer 
inhibitor. In another embodiment, a method of inhibiting metastatic colorectal cancer is 
provided. The method comprises administration of a metastatic colorectal cancer inhibitor. 
In a further embodiment, methods of treating cells or individuals with metastatic colorectal 
cancer are provided. The method comprises administration of a metastatic colorectal cancer 
inhibitor. 

A variety of cell growth, proliferation, and metastasis assays are known to 
those of skill in the art, as described below. 

Soft agar growth or colony formation in suspension 

Normal cells require a solid substrate to attach and grow. When the cells are 
transformed, they lose this phenotype and grow detached from the substrate. For example, 
transformed cells can grow in stirred suspension culture or suspended in semi-solid media, 
such as semi-solid or soft agar. The transformed cells, when transfected with tumor 
suppressor genes, regenerate normal phenotype and require a solid substrate to attach and 
grow. Soft agar growth or colony formation in suspension assays can be used to identify 
modulators of metastatic colorectal cancer sequences, which when expressed in host cells, 
inhibit abnormal cellular proliferation and transformation. A therapeutic compound would 
reduce or eliminate the host cells* ability to grow in stirred suspension culture or suspended 
in semi-solid media, such as semi-solid or soft 

Techniques for soft agar growth or colony formation in suspension assays are 
described in Freshney, Culture of Animal Cells a Manual of Basic Technique (3 rd ed., 1994), 
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herein incorporated by reference. See also, (he methods section of Gaikavtsev et al (1996), 
supra, herein incorporated by reference. 

Contact inhibition and density limitation of growth 

Normal cells typically grow in a flat and organized pattern in apetri dish until 
they touch other cells. When the cells touch one another, they are contact inhibited and stop 
growing. When cells are transformed, however, the cells are not contact inhibited and 
continue to grow to high densities in disorganized foci. Thus, the transformed cells grow to a 
higher saturation density than normal cells. This can be detected morphologically by the 
formation of a disoriented monolayer of cells or rounded cells in foci within the regular 
pattern of normal surrounding cells. Alternatively, labeling index with ( 3 H)-thymidine at 
saturation density can be used to measure density limitation of growth. See Freshney (1994), 
supra. The transformed cells, when transfected with tumor suppressor genes, regenerate ab- 
normal phenotype and become contact inhibited and would grow to a lower density. 

In this assay, labeling index with ( 3 H)-thymidine at saturation density is a 
preferred method of measuring density limitation of growth. Transformed host cells are 
transfected with a metastatic colorectal cancer-associated sequence and are grown for 24 
hours at saturation density in non-limiting medium conditions. The percentage of cells 
labeling with (^-thymidine is determined autoradiograpbically. See, Freshney (1994), 
supra. 

Growth factor or serum dependence 

Transformed cells have a lower serum dependence than their normal 
counterparts (see, e.g., Temin, /. Natl Cancer InstL 37:167-175 (1966); Eagle et al, J. Exp. 
Med 131:836-879 (1970)); Freshney, supra. This is in part due to release of various growth 
factors by the transformed cells. Growth factor or serum dependence of transformed host 
cells can be compared with that of control. 

Tumor specific markers levels 

Tumor cells release an increased amount of certain factors (hereinafter "tumor 
specific markers") than their normal counterparts. For example, plasminogen activator (PA) 
is released from human glioma at a higher level than from normal brain cells (see, e.g., 
Gullino, Angiogenesis, tumor vascularization, and potential interference with tumor growth, 
in Biological Responses in Cancer, pp. 178-184 (Mihich (ed.) 1985)). Similarly, Tumor 
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angiogenesis factor (TAF) is released at a higher level in tumor cells than their normal 
counterparts. See, eg, Fo lfrm an , Angiogenesis and Cancer, Sent Cancer Biol (1992)). 

Various techniques which measure the release of these factors are described in 
Freshney (1994), supra. Also, see, Uhkless etaL,J. Biol Chem. 249:4295-4305 (1974); 
Strickland & Beers, J. Biol Chem. 251:5694-5702 (1976); Whtnr et al, Br. J. Cancer 42:305- 
312 (1980); Gullino, Angiogenesis, tumor vascularization, and potential interference with 
tumor growth, rn Biological Responses in Cancer, pp. 178-184 (Nfihich (ed.) 1985); 
Freshney Anticancer Res. 5:111-130 (1985). 

Invasiveness into Matrigel 

The degree of invasiveness into Matrigel or some other extracellular matrix 
constituent can be used as an assay to identify compounds that modulate metastatic colorectal 
cancer-associated sequences. Tumor cells exhibit a good correlation between malignancy - 
and invasiveness of cells into Matrigel or some other extracellular matrix constituent In this 
assay, tumorigenic cells are typically used as host cells. Expression of a tumor suppressor 
gene in these host cells would decrease invasiveness of the host cells. 

Techniques described in Freshney (1994), supra, can be used. Briefly, the 
level of invasion of host cells can be measured by using filters coated with Matrigel or some 
other extracellular matrix constituent Penetration into the gel, or through to the distal side of 
the filter, is rated as invasiveness, and rated histologically by number of cells and distance 
moved, or by prelabeling the cells with l2S l and counting the radioactivity on the distal side of 
the filter or bottom of the dish. See, e.g., Freshney (1984), supra. 

Tumor growth in vivo 

Effects of metastatic colorectal cancer-associated sequences on cell growths 
can be tested in transgenic or immune-suppressed mice. Knock-out transgenic mice can be 
made, in which the metastatic colorectal cancer gene is disrupted or in which a metastatic 
colorectal cancer gene is inserted. Knock-out transgenic mice can be made by insertion of a 
marker gene or other heterologous gene into the endogenous metastatic colorectal cancer 
gene site in the mouse genome via homologous recombination. Such mice can also be made 
by substituting the endogenous metastatic colorectal cancer gene with a mutated version of 
the metastatic colorectal cancer gene, or by mutating the endogenous metastatic colorectal 
cancer gene, e.g., by exposure to carcinogens. 
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A DNA construct is introduced into the nuclei of embryonic stem cells. Cells 
containing the newly engineered genetic lesion are injected into a host mouse embryo, which 
is re-implanted into a recipient female. Some of these embryos develop into chimeric mice 
that possess germ cells partially derived from the mutant cell line. Therefore, by breeding the 
chimeric mice it is possible to obtain a new line of mice containing the introduced genetic 
lesion (see, e.g 9 Capecchi et aL, Science 244:1288 (1989)). Chimeric targeted mice can be 
derived according to Hogan et aL , Manipulating the Mouse Embryo: A Laboratory Manual, 
Cold Spring Harbor Laboratory (1988) and Teratocardnomas and Embryonic Stem Cells: A 
Practical Approach, Robertson, e<L, IRL Press, Washington, D.C., (1987). 

Alternatively, various immune-suppressed or immune-deficient host animals 
can be used. For example, genetically athymic "nude" mouse (see, e.g., Giovaneila et al, J. 
Natl Cancer Inst. 52:921 (1974)), a SC3D mouse, a thymectomized mouse, or an irradiated 
mouse (see, e.g, Bradley et aL, Br. J. Cancer 38:263 (1978); Selby et aL 9 Br. J. Cancer - 
41 :52 (1980)) can be used as a host transplantable tumor cells (typically about 10 6 cells) 
injected into isogenic hosts will produce invasive tumors in a high proportions of cases, while 
normal cells of similar origin will not In hosts which developed invasive tumors, cells 
expressing a metastatic colorectal cancer-associated sequences are injected subcutaneously. 
After a suitable length of time, preferably 4-8 weeks, tumor growth is measured (e.g., by 
volume or by its two largest dimensions) and compared to the control. Tumors that have 
statistically significant reduction (using, e.g., Student's T test) are said to have inhibited 
growth. Additionally, human tumor cells expressing the genes of the invention may be 
injected into immune compromised animals. Growth of these tumors, or xenografts, is 
compared to growth of similar human tumor cell that do not express the genes of the 
invention. These animals may also be used to binding assays and efficacy studies for 
therapeutic compounds that modulate metastatic colorectal cancer, such as antibodies or 
small molecules. 

Polynucleotide modulators of metastatic colorectal cancer 

Antisense Polynucleotides 

In certain embodiments, the activity of a metastatic colorectal cancer- 
associated protein is downregulated, or entirely inhibited, by the use of antisense 
polynucleotide, i.e., a nucleic acid complementary to, and which can preferably hybridize 
specifically to, a coding mRNA nucleic acid sequence, e.g., a metastatic colorectal cancer 
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protein mRNA, or a subsequence thereof Binding of the antisense polynucleotide to the 
mRNA reduces the translation and/or stability of the mRNA 

In the context of this invention, antisense polynucleotides can comprise 
naturally-occurring nucleotides, or synthetic species formed from naturaUy^ccurring 
subunits or their close homologs. Antisense polynucleotides may also have altered sugar 
moieties or inter-sugar linkages. Exemplary among these are the phosphorothioate and other 
sulfur containing species which are known for use in the art Analogs are comprehended by 
this invention so long as they function effectively to hybridize with the metastatic colorectal 
cancer protein mRNA See, eg., bis Pharmaceuticals, Carlsbad, CA; Sequitor, Inc., Natick, 
MA 

Such antisense polynucleotides can readily be synthesized using recombinant 
means, or can be synthesized in vitro. Equipment for such synthesis is sold by several 
vendors, including Applied Biosystons. The preparation of other oligonucleotides such as - - 
phosphorothioates and alkylated derivatives is also well known to those of skill in the art 

Antisense molecules as used herein include antisense or sense 
* oligonucleotides. Sense oligonucleotides can, e.g., be employed to block transcription by 
binding to the anti-sense strand. The antisense and sense oligonucleotide comprise a single- 
stranded nucleic acid sequence (either KNA or DNA) capable of binding to target mRNA 
(sense) or DNA (antisense) sequences for metastatic colorectal cancer molecules. A 
preferred antisense molecule is for a metastatic colorectal cancer sequence in Tables 1-26, or 
for a ligand or activator thereof Antisense or sense oligonucleotides, according to the 
present invention, comprise a fragment generally at least about 14 nucleotides, preferably 
from about 14 to 30 nucleotides. The ability to derive an antisense or a sense 
oligonucleotide, based upon a cDNA sequence encoding a given protein is described in, e.g., 
Stein & Cohen {Cancer Res. 48:2659 (1988 and van dor Krol et al. (BioTeckniques 6:958 - 
(1988)). 

Eibozymes 

In addition to antisense polynucleotides, ribozymes can be used to target and 
inhibit transcription of metastatic colorectal cancer-associated nucleotide sequences. A 
ribozyme is an RNA molecule that catalytically cleaves other RNA molecules. Different 
kinds of ribozymes have been described, including group I ribozymes, hammerhead 
ribozymes, hairpin ribozymes, RNase P, and axhead ribozymes {see, e.g., Castanotto et aL, 
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Adv. in Pharmacology 25: 289t317 (1994) for a general review of the properties of different 
ribozymes). 

The general features of hairpin ribozymes are described, e.g., in Hampel et al, 
Nucl Acids Res. 18:299-304 (1990); European Patent Publication No. 0 360 257; U.S. Patent 
No. 5,254,678. Methods of preparing are well known to those of skill in the art (see, e.g., 
WO 94/26877; Ojwang et al, Proc Natl. Acad. Sci. USA 90:6340-6344 (1993); Yamada et 
al 9 Human Gene Therapy 1:39-45 (1994); Leavitt et al 9 Proa Natl Acad. Sci. USA 92:699- 
703 (1995); Leavitt et al. y Human Gene Therapy 5:1 151-120 (1994); and Yamada et al, 
Virology 205: 121-126 (1994)). 

Polynucleotide modulators of metastatic colorectal cancer may be introduced 
into a cell containing the target nucleotide sequence by formation of a conjugate with a ligand 
binding molecule, as described in WO 91/04753. Suitable ligand binding molecules include, 
but are not limited to, cell surface receptors, growth factors, other cytokines, or other ligands 
that bind to cell surface receptors. Preferably, conjugation of the ligand binding molecule 
does not substantially interfere with the ability of the ligand binding molecule to bind to its 
corresponding molecule or receptor, or block entry of the sense or antisense oligonucleotide 
or its conjugated version into the celL Alternatively, a polynucleotide modulator of 
metastatic colorectal cancer may be introduced into a cell containing the target nucleic acid 
sequence, e.g., by formation of an polynucleotide-lipid complex, as described in WO 
90/10448. It is understood that the use of antisense molecules or knock out and knock in 
models may also be used in screening assays as discussed above, in addition to methods of 
treatment. 

Thus, in one embodiment, methods of modulating metastatic colorectal cancer 
in cells or organisms are provided. In one embodiment, the methods comprise administering 
to a cell an anti-metastatic colorectal cancer antibody that reduces or eliminates the biological 
activity of an endogenous metastatic colorectal cancer protein. Alternatively, the methods 
comprise administering to a cell or organism a recombinant nucleic acid encoding a 
metastatic colorectal cancer protein. This may be accomplished in any number of ways. In a 
preferred embodiment, e.g., when the metastatic colorectal cancer sequence is down- 
regulated in metastatic colorectal cancer, such state may be reversed by increasing the 
amount of metastatic colorectal cancer gene product in the cell. This can be accomplished, 
e.g., by overexpressing the endogenous metastatic colorectal cancer gene or administering a 
gene encoding the metastatic colorectal cancer sequence, using known gene-therapy 
techniques. In a preferred embodiment, the gene therapy techniques include the 
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incorporation of the exogenous gene using enhanced homologous recombination (EHR), e.g., 
as described in PCT/US93/03868, hereby incorporated by reference in its entirety. 
Alternatively, e.g., when the metastatic colorectal cancer sequence is up-regulated in 
metastatic colorectal cancer, the activity of the endogenous metastatic colorectal cancer gene 
is decreased, e.g., by the administration of a metastatic colorectal cancer antisense nucleic 
acid. 

In one embodiment, the metastatic colorectal cancer proteins of the present 
invention may be used to generate polyclonal and monoclonal antibodies to metastatic 
colorectal cancer proteins. Similarly, the metastatic colorectal cancer proteins can be 
coupled, using standard technology, to affinity chromatography columns. These columns 
may then be used to purify metastatic colorectal cancer antibodies useful for production, 
diagnostic, or therapeutic purposes. In a preferred embodiment, the antibodies are generated 
to epitopes unique to a metastatic colorectal cancer protein; that is, the antibodies show little 
or no cross-reactivity to other proteins. The metastatic colorectal cancer antibodies may be 
coupled to standard affinity chromatography columns and used to purify metastatic colorectal 
cancer proteins. The antibodies may also be used as blocking polypeptides, as outlined 
above, since they will specifically bind to the metastatic colorectal cancer protein. 

Methods of identifying variant metastatic colorectal cancer-associated sequences 

Without being bound by theory, expression of various metastatic colorectal 
cancer sequences is correlated with metastatic colorectal cancer. Accordingly, disorders 
based on mutant or variant metastatic colorectal cancer genes may be determined. In one 
embodiment, the invention provides methods for identifying cells containing variant 
metastatic colorectal cancer genes, e.g., determining all or part of the sequence of at least one 
endogenous metastatic colorectal cancer genes in a cell. This may be accomplished using ^ 
any number of sequencing techniques. Li a preferred embodiment, the invention provides 
methods of identifying the metastatic colorectal cancer genotype of an individual, e.g., 
determining all or part of the sequence of at least one metastatic colorectal cancer gene of the 
individual. This is generally done in at least one tissue of the individual, and may include the 
evaluation of a number of tissues or different samples of the same tissue. The method may 
include comparing the sequence of the sequenced metastatic colorectal cancer gene to a 
known metastatic colorectal cancer gene, i.e., a wild-type gene. 

The sequence of all or part of the metastatic colorectal cancer gene can then be 
compared to the sequence of a known metastatic colorectal cancer gene to determine if any 
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differences exist This can be done using any number of known homology programs, such as 
Bestfit, etc. In a preferred embodiment, the presence of a difference in the sequence between 
the metastatic colorectal cancer gene of the patient and the known metastatic colorectal 
cancer gene correlates with a disease state or a propensity for a disease state, as outlined 
herein. 

In a preferred embodiment, the metastatic colorectal cancer genes are used as 
probes to determine the number of copies of the metastatic colorectal cancer gene in the 
genome. 

In another preferred embodiment, the metastatic colorectal cancer genes are 
used as probes to determine the chromosomal localization of the metastatic colorectal cancer 
genes. Information such as chromosomal localization finds use in providing a diagnosis or 
prognosis in particular when chromosomal abnormalities such as translocations, and the like 
are identified in the metastatic colorectal cancer gene locus. ~ 

Administration of pharmaceutical and vaccine compositions 

In one embodiment, a therapeutically effective dose of a metastatic colorectal 
cancer protein or modulator thereof is administered to a patient By 'therapeutically 
effective dose" herein is meant a dose that produces effects for which it is administered. The 
exact dose will depend on the purpose of the treatment, and will be ascertainable by one 
skilled in the art using known techniques (e.g., Ansel et al, Pharmaceutical Dosage Forms 
and Drug Delivery; Lieberman, Pharmaceutical Dosage Forms (vols. 1-3, 1992), Dekker, 
ISBN 0824770846, 082476918X, 0824712692, 0824716981; Lloyd, The Art, Science and 
Technology of Pharmaceutical Compounding (1999); and Pickar, Dosage Calculations 
(1999)). As is known in the art, adjustments for metastatic colorectal cancer degradation, 
systemic versus localized delivery, and rate of new protease synthesis, as well as the age, - 
body weight, general health, sex, diet, time of administration, drug interaction and the 
severity of the condition may be necessary, and will be ascertainable with routine 
experimentation by those skilled in the art 

A patient" for the purposes of the present invention includes both humans 
and other animals, particularly mammals. Thus the methods are applicable to both human 
therapy and veterinary applications. In the preferred embodiment the patient is a mammal, 
preferably a primate, and in the most preferred embodiment the patient is human. 

The administration of the metastatic colorectal cancer proteins and modulators 
thereof of the present invention can be done in a variety of ways as discussed above, 
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including, but not limited to, orally, subcutaneously, intravenously, intranasally, 
transdermally, intraperitoneally, intramuscularly, intrapufanonary, vaginally, rectally, or 
intraocularly. In some instances, e.g., in the treatment of wounds and inflammation, the 
metastatic colorectal cancer proteins and modulators may be directly applied as a solution or 
spray. 

The pharmaceutical compositions of the present invention comprise a 
metastatic colorectal cancer protein in a form suitable for administration to a patient In the 
preferred embodiment, the pharmaceutical compositions are in a water soluble form, such as 
being present as pharmaceutically acceptable salts, which is meant to include both acid and 
base addition salts- "Pharmaceutically acceptable acid addition salt" refers to those salts that 
retain die biological effectiveness of the free bases and that are not biologically or otherwise 
undesirable, formed with inorganic acids such as hydrochloric acid, hydrobromic acid, 
sulfuric acid, nitric acid, phosphoric acid and the like, and organic acids such as acetic acid, 
propionic acid, glycolic acid, pyruvic acid, oxalic acid, maleic acid, malonic acid, succinic 
acid, fumaric acid, tartaric acid, citric acid, benzoic acid, cinnamic acid, mandelic acid, 
methanesulfonic arid, ethanesulfonic acid, p-toluenesulfonic acid, salicylic acid and the like. 
"Pharmaceutically acceptable base addition salts" include those derived from inorganic bases 
such as sodium, potassium, lithium, ammonium, calcium, magnesium, iron, zinc, copper, 
manganese, aluminum salts and the like. Particularly preferred arc the aJifrnto niiTmj 
potassium, sodium, calcium, and magnesium salts. Salts derived from pharmaceutically 
acceptable organic non-toxic bases include salts of primary, secondary, and tertiary amine s, 
substituted amines including naturally occurring substituted amines, cyclic amines and basic 
ion exchange resins, such as isopropylamine, trimethylamine, diethylamine, triethylamine, 
tripropylamine, and ethanolamine. 

The pharmaceutical compositions may also include one or more of the 
following: carrier proteins such as serum albumin; buffers; fillers such as microcrystalline 
cellulose, lactose, corn and other starches; binding agents; sweeteners and other flavoring 
agents; coloring agents; and polyethylene glycol. 

The pharmaceutical compositions can be administered in a variety of unit 
dosage forms depending upon the method of administration. For example, unit dosage forms 
suitable for oral administration include, but are not limited to, powder, tablets, pills, capsules 
and lozenges. It is recognized that metastatic colorectal cancer protein modulators (e.g., 
antibodies, antisense constructs, ribozymes, small organic molecules, etc.) when administered 
orally, should be protected from digestion It is also recognized that, after delivery to other 
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sites in the body (e.g., circulatory system, lymphatic system, or the tumor site) the metastatic 
colorectal cancer modulators of the invention may need to be protected from excretion, 
hydrolisis, proteolytic digestion or modification, or detoxification by the liver, hi all these 
cases, protection is typically accomplished either by complexing the molecule(s) with a 
composition to render it resistant to acidic and enzymatic hydrolysis, or by packaging the 
molecule(s) in an appropriately resistant carrier, such as a liposome or a protection barrier or 
by modifying the molecular size, weight, and/or charge of the modulator. Means of 
protecting agents from digestion degradation, and excretion are well known in the art 

The compositions for administration will commonly comprise a metastatic 
colorectal cancer protein modulator dissolved in a pharmaceutical^ acceptable carrier, 
preferably an aqueous carrier. A variety of aqueous carriers can be used, e.g., buffered saline 
and the like. These solutions are sterile and generally free of undesirable matter. These 
compositions may be sterilized by conventional, well known sterilization techniques. The^ 
compositions may contain pharmaceutical^ acceptable auxiliary substances as required to 
approximate physiological conditions such as pH adjusting and buffering agents, toxicity 
adjusting agents and the like, e.g., sodium acetate, sodium chloride, potassium chloride, 
calcium chloride, sodium lactate and the like. The concentration of active agent in these 
formulations can vary widely, and will be selected primarily based on fluid volumes, 
viscosities, body weight and the like in accordance with the particular mode of administration 
selected and the patient's needs (e.g., Remington 's Pharmaceutical Science (15th ed, 1980) 
and Goodman & Gillman, The Pharmacologial Basis of Therapeutics (Hantaan et al 9 eds., 
1996)). 

Thus, a typical pharmaceutical composition for intravenous administration 
would be about 0.1 to 10 mg per patient per day. Dosages from 0.1 up to about 100 mg per 
patient per day may be used, particularly when the drug is administered to a secluded site and 
not into the blood stream, such as into a body cavity or into a lumen of an organ. 
Substantially higher dosages are possible in topical administration. Actual methods for 
preparing parenterally administrable compositions will be known or apparent to those skilled 
in the art, e.g., Remington 's Pharmaceutical Science and Goodman and Gillman, The 
Pharmacologial Basis of Therapeutics, supra. 

The compositions containing modulators of metastatic colorectal cancer 
proteins can be administered for therapeutic or prophylactic treatments. In therapeutic 
applications, compositions are administered to a patient suffering from a disease (e.g., a 
cancer) in an amount sufficient to cure or at least partially arrest the disease and its 
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complications. An amount adequate to accomplish this is defined as a "therapeutically 
effective dose." Amounts effective for this use will depend upon the severity of the disease 
and the general state of the patient's health. Single or multiple administrations of the 
compositions maybe administered depending on the dosage and frequency as required and 
tolerated by the patient In any event, the composition should provide a sufficient quantity of 
the agents of this invention to effectively treat the patient An amount of modulator that is 
capable of preventing or slowing the development of cancer in a mammal is referred to as a 
'^rophylactically effective dose" The particular dose required for a prophylactic treatment 
will depend upon the medical condition and history of the mammal, the particular cancer 
being prevented, as well as other factors such as age, weight, gender, administration route, 
efficiency, etc. Such prophylactic treatments may be used, e.g., in a mammal who has 
previously had cancer to prevent a recurrence of the cancer, or in a mammal who is suspected 
of having a significant likelihood of developing cancer. - 

It will be appreciated that the present metastatic colorectal cancer protein- 
modulating compounds can be administered alone or in combination with additional 
metastatic colorectal cancer modulating compounds or with other therapeutic agent, e.g., 
other anti-cancer agents or treatments. 

In numerous embodiments, one or more nucleic acids, e.g., polynucleotides 
comprising nucleic acid sequences set forth in Tables 1-26, such as antisense polynucleotides 
or ribozymes, will be introduced into cells, in vitro or in vivo. The present invention provides 
methods, reagents, vectors, and cells useful for expression of metastatic colorectal cancer- 
associated polypeptides and nucleic acids using in vitro (cell-free), ex vivo or in vivo (cell or 
organism-based) recombinant expression systems. 

The particular procedure used to introduce the nucleic acids into a host cell for 
expression of a protein or nucleic acid is application specific. Many procedures for 
introducing foreign nucleotide sequences into host cells may be used. These include the use 
of calcium phosphate transfection, spheroplasts, electroporation, liposomes, microinjection, 
plasma vectors, viral vectors and any of the other well known methods for introducing cloned 
genomic DNA, cDNA, synthetic DNA or other foreign genetic material into a host cell (see, 
e.g, Berger & Kimmel, Guide to Molecular Cloning Techniques, Methods in Enzymology 
volume 152 (Berger), Ausubel et al. 9 eds., Current Protocols (supplemented through 1999), 
and Sambrook et aL 9 Molecular Cloning - A Laboratory Manual (2nd ed., Vol. 1-3, 1989. 

hi a preferred embodiment, metastatic colorectal cancer proteins and 
modulators are administered as therapeutic agents, and can be formulated as outlined above. 
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Similarly, metastatic colorectal cancer genes (including both the full-length sequence, partial 
sequences, or regulatory sequences of the metastatic colorectal cancer coding regions) can be 
administered in a gene therapy application. These metastatic colorectal cancer genes can 
include antisense applications, either as gene therapy (Le., for incorporation into the genome) 
or as antisense compositions, as will be appreciated by those in the art 

Metastatic colorectal cancer polypeptides and polynucleotides can also be 
administered as vaccine compositions to stimulate HTL, CTL and antibody responses.. Such 
vaccine compositions can include, e.g., lipidated peptides (see, e.g.,Vitiello, et al 9 J. Clin. 
Invest 95:341 (1995)), peptide compositions encapsulated in poly(DL-lactide-co-glycolide) 
("PLG") microspheres (see, e.g, Eldridge, et al. 9 Molec Immunol. 28:287-294, (1991); 
Alonso etal., Vaccine 12:299-306 (1994); Jones et al. 9 Vaccine 13:675-681 (1995)), peptide 
compositions contained in immune stimulating complexes (ISCOMS) (see, e.g, Takahashi et 
al 9 Nature 344:873-875 (1990); Hu et al, Gin Exp Immunol 1 13:235-243 (1998)), multiple 
antigen peptide systems (MAPs) (see, e.g. 9 Tarn, Proc. Natl Acad. Sci. U.SA. 85:5409-5413 
(1988); Tarn, J. Immunol Methods 196:17-32 (1996)), peptides formulated as multivalent 
peptides; peptides for use in ballistic delivery systems, typically crystallized peptides, viral 
delivery vectors (Perkus, et al 9 In: Concepts in vaccine development (Kaufinann, ed., p. 379, 
1996); Chakrabarti, et al 9 Nature 320:535 (1986); Hu et al 9 Nature 320:537 (1986); Kieny, 
et al , AIDS Bio/Technology 4:790 (1986); Top et al 9 J. Infect Dis. 124:148 (1971); Chanda 
et al 9 Virology 175:535 (1990)), particles of viral or synthetic origin (see, e.g. 9 Kofler et al 9 
J. Immunol Methods. 192:25 (1996); Eldridge et al. 9 Sem. Hematol 30:16 (1993); Falo et al 9 
Nature Med. 7:649 (1995)), adjuvants (Warren et al. 9 Annu. Rev. Immunol 4:369 (1986); 
Gupta et al 9 Vaccine 11:293 (1993)), liposomes (Reddy et al 9 J. Immunol 148:1585 (1992); 
Rock, Immunol Today 17:131 (1996)), or, naked or particle absorbed cDNA (Ulmer, et al 9 
Science 259:1745 (1993); Robinson et al. 9 Vaccine 1 1 :957 (1993); Shiver et al 9 In: Concepts 
in vaccine development (Kaufinann, ed, p. 423, 1996); Cease & Berzofsky, Annu. Rev. 
Immunol 12:923 (1994) and Eldridge et al 9 Sem. Hematol. 30:16 (1993)). Toxin-targeted 
delivery technologies, also known as receptor mediated targeting, such as those of Avant 
Immunotherapeutics, hie. (Needham, Massachusetts) may also be used. 

Vaccine compositions often include adjuvants. Many adjuvants contain a 
substance designed to protect the antigen from rapid catabolism, such as aluminum hydroxide 
or mineral oil, and a stimulator of immune responses, such as lipid A, Bortadella pertussis or 
Mycobacterium tuberculosis derived proteins. Certain adjuvants are commercially available 
as, e.g., Freund's Incomplete Adjuvant and Complete Adjuvant (Difco Laboratories, Detroit, 
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MQ; Merck Adjuvant 65 (Merck and Company, Inc., Rahway, NJ); AS-2 (SmithKline 
Beecham, Philadelphia, PA); aluminum salts such as aluminum hydroxide gel (alum) or 
aluminum phosphate; salts of calcium, iron or zinc; an insoluble suspension of acylated 
tyrosine; acylated sugars; cationically or anionically derivatized polysaccharides; 
polyphosphazenes; biodegradable microspheres; monophosphoryl lipid A and quil A. 
Cytokines, such as GM-CSF, interleukm-2, -7, -12, and other like growth factors, may also be 
used as adjuvants. 

Vaccines can be administered as nucleic acid compositions wherein DNA or 
RNA encoding one or more of the polypeptides, or a fragment thereof is administered to a 
patient This approach is described, for instance, in Wolff eL a/., Science 247:1465 (1990) as 
well as U.S. Patent Nos. 5,580,859; 5,589,466; 5,804,566; 5,739,1 18; 5,736,524; 5,679,647; 
WO 98/04720; and in more detail below. Examples of DNA-based delivery technologies 
include "naked DNA", facilitated (bupivicaine, polymers, peptide-mediated) delivery, ~ 
cationic lipid complexes, and particle-mediated ("gene gun") or pressure-mediated delivery 
(see, e.g., U.S. Patent No. 5,922,687). 

For therapeutic or prophylactic immunization purposes, the peptides of the 
invention can be expressed by viral or bacterial vectors. Examples of expression vectors 
include attenuated viral hosts, such as vaccinia or fowlpox. This approach involves the use of 
vaccinia virus, e.g., as a vector to express nucleotide sequences that encode metastatic 
colorectal cancer polypeptides or polypeptide fragments. Upon introduction into a host, die 
recombinant vaccinia virus expresses the immunogenic peptide, and thereby elicits an 
immune response. Vaccinia vectors and methods useful in immunization protocols are 
described in, e.g., U.S. Patent No. 4,722,848. Another vector is BCG (Bacille Calmette 
Guerin). BCG vectors are described in Stover et aL, Nature 351 :456-460 (1991). Awide 
variety of other vectors useful for therapeutic administration or immunization e.g., adeno and 
adeno-associated virus vectors, retroviral vectors, Salmonella typhi vectors, detoxified 
anthrax toxin vectors, and the like, will be apparent to those skilled in the art from the 
description herein (see, e.g., Shata et aL 9 MolMed Today 6:66-71 (2000); Shedlock et al 9 J 
Leukoc Biol 68:793-806 (2000); Hipp et al., In Vivo 14:571-85 (2000)). 

Methods for the use of genes as DNA vaccines are well known, and include 
placing a metastatic colorectal cancer gene or portion of a metastatic colorectal cancer gene 
under the control of a regulatable promoter or a tissue-specific promoter for expression in a 
metastatic colorectal cancer patient The metastatic colorectal cancer gene used for DNA 
vaccines can encode full-length metastatic colorectal cancer proteins, but more preferably 
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encodes portions of the metastatic colorectal cancer proteins including peptides derived from 
the metastatic colorectal cancer protein. In one embodiment, a patient is immunized with a 
DNA vaccine comprising a plurality of nucleotide sequences derived from a metastatic 
colorectal cancer gene. For example, metastatic colorectal cancer-associated genes or 
sequence encoding subfragments of a metastatic colorectal cancer protein are introduced into 
expression vectors and tested for their immunogenicity in the context of Class I MHC and an 
ability to generate cytotoxic T cell responses. This procedure provides for production of 
cytotoxic T cell responses against cells which present antigen, including intracellular 
epitopes. 

In a preferred embodiment, the DNA vaccines include a gene encoding an 
adjuvant molecule with the DNA vaccine. Such adjuvant molecules include cytokines that 
increase the immunogenic response to the metastatic colorectal cancer polypeptide encoded 
by the DNA vaccine. Additional or alternative adjuvants are available. " 

In another preferred embodiment metastatic colorectal cancer genes find use in 
generating animal models of metastatic colorectal cancer. When the metastatic colorectal 
cancer gene identified is repressed or diminished in metastatic tissue, gene therapy 
technology, e.g., wherein antisense RNA directed to the metastatic colorectal cancer gene 
will also diminish or repress expression of the gene. Animal models of metastatic colorectal 
cancer find use in screening for modulators of a metastatic colorectal cancer-associated 
sequence or modulators of metastatic colorectal cancer. Similarly, transgenic animal 
technology including gene knockout technology, e.g., as a result of homologous 
recombination with an appropriate gene targeting vector, will result in the absence or 
increased expression of the metastatic colorectal cancer protein. When desired, tissue- 
specific expression or knockout of the metastatic colorectal cancer protein may be necessary. 

It is also possible that the metastatic colorectal cancer protein is overexpressed 
in metastatic colorectal cancer. As such, transgenic animals can be generated that 
overexpress the metastatic colorectal cancer protein. Depending on the desired expression 
level, promotes of various strengths can be employed to express the transgene. Also, the 
number of copies of the integrated transgene can be determined and compared for a 
determination of the expression level of the transgene. Animals generated by such methods 
find use as animal models of metastatic colorectal cancer and are additionally useful in 
screening for modulators to treat metastatic colorectal cancer. 



80 



WO 02/068677 



PCT/US02/06001 



Kits for Use in Diagnostic and/or Prognostic Applications 

For use in diagnostic, research, and therapeutic applications suggested above, 
kits are also provided by the invention. In the diagnostic and research applications such kits 
may include any or all of the following: assay reagents, buffers, metastatic colorectal cancer- 
specific nucleic acids or antibodies, hybridization probes and/or primers, antisense 
polynucleotides, ribozymes, dominant negative metastatic colorectal cancer polypeptides or 
polynucleotides, small molecules inhibitors of metastatic colorectal cancer-associated 
sequences etc. A therapeutic product may include sterile saline or another pharmaceutical^ 
acceptable emulsion and suspension base. 

In addition, the kits may include instructional materials containing directions 
(i.e., protocols) for the practice of the methods of this invention. While the instructional 
materials typically comprise written or printed materials they are not limited to such. Any 
medium capable of storing such instructions and communicating them to an end user is - 
contemplated by this invention. Such media include, but are not limited to electronic storage 
media (e.g., magnetic discs, tapes, cartridges, chips), optical media (e.g., CD ROM), and the 
like. Such media may include addresses to internet sites that provide such instructional 
materials. 

The present invention also provides for kits for screening for modulators of 
metastatic colorectal cancer-associated sequences. Such kits can be prepared from readily 
available materials and reagents. For example, such kits can comprise one or more of the 
following materials: a metastatic colorectal cancer-associated polypeptide or polynucleotide, 
reaction tubes, and instructions for testing metastatic colorectal cancer-associated activity. 
Optionally, the kit contains biologically active metastatic colorectal cancer protein. A wide 
variety of kits and components can be prepared according to the present invention, depending 
upon the intended user of the kit and the particular needs of the user. Diagnosis would - 
typically involve evaluation of a plurality of genes or products. The genes will be selected 
based on correlations with important parameters in disease which may be identified ip. 
historical or outcome data. 
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ESTs 175 

ESTs;HigWysirniiartopufnTonarysifffK 175 

Mucm(GfcM57417) 3.75 

Hs.186550 ESTs 173 

Hs.10615 ESTs 17 

Hs.94304 EST 358 

Hs.122576 ESTs 356 

Hs.180911 ribosomaJprotehS4;Y4Mced 162 

Hs.187571 ESTs 16 

Hs.183852 ESTs;WeaJdys^topdyrnerase[asa 158 
Hs59766 H sapiens mRNA; cDKA DKFZp564J0323 (from 

Hs.91608 otoferfin 353 

Hs.1 58084 peroxisome receptor 1 351 

Hs.124724 EST3; WeaWy similar to coded for by C. 147 

Hs.122489 ESTs 146 

H&2Q3910 small glufemlne-rich tetratricopeptide r 145 

Hs.19081 ESTs 143 

H&59908 nuclear receptor aactivator 4 14 
zm24g9.s1 Stratagem pancreas (#93728) H 
mRNAseq 



14 

Ks25199 ESTs; HfeWysBrilar to match to ESTs AA 138 
Hs304238 ^ 
Hs.155597 



138 
137 



fipocaSn 2 (oncogene 24p3) 
D component of complement (adipsm) 

y|76d5^l Soanra breast 2NbHBst H sapten 136 

Hs£05126 polymencimmunogjrjbi^ receptor 135 

Hs.191466 ESTs 351 

Ks247433 aaSvatmgtrarecripfion factor 6 3.19 

Hs.16304 ESTs 3.18 
Hs.789 GROI oncogene (melanoma growth sfimuiafi 3.16 



EB cells, KT29 eels, HMEC 

HMEC (total RNA), Fibroblasts 2, Fibroblasts 2 

Fibroblasts 2, MB231_cefls, M&WDA453 

EB_ceUs, OVCARjato, HS578T_ceS3 

HMEC (total RNA), HMEC, Fibroblasts 2 

HMEC, HMEC (total RNA), Fibroblasts 2 

HRCC. HMEC (total RNA), UiJDjm 

BT474_cefls, FBxobiasts 2, hB231_cefls 

OVCAfLcells, MCF7, 293Tjsfe 

Fibroblasts 2, HNEC. HT29jceis 

HS578T_ceite, HMEC (total RNA), HMEC 

EB.cefe, HT29_cefls, MB231_cels 

HNEC (total RNA), HMEC, EBjsfe 

HS578T_cefls, HMEC, Lu_SQ_H520 

HMEC, HS578T_cefe, BT474_cefe 

HA£C, HS578Tjcefis, Caco2 

HMEC. HMEC (toted RNA), HS578T_ce03 ~ 

HMEC, HMEC (total RNA), MB-MDA-435s - 

HMEC, EB.cete, HMEC (total RNA) 

HMEC (total RNA), HMEC, Fibroblasts 2 

4.04 HMEC (total RNA), HMEC, Fibroblasts 2 

EB__cefls, HMEC, HMEC (total RNA) 

HMEC (total RNA), HMEC. Fixubtasts2 . 

HMEC, HMEC (total RNA), EB_cefe 

HMEC (total RNA), HMEC, FHxoblasts2 

EB_ce!ls, HMEC (total RNA), HMEC 

HMEC (total RNA), HMEC, Fibroblasts 2 

PC3jceIls, RPW^2, Caco2 

EBjceOs, HMEC, HMEC (total RNA) 

EB.cefls, HMEC, HMEC (total RNA) 

HS578T.ceQs, M&M)A-435s, KT29_ceQs 

HS578T_cdt3, HMEC, LNCaPjceQs 

DU145__ce&, LilADlH23. MB231_ce0s 

mCaP_cete, U.SCJC45, DU145_cefe 

HS578T_celts, A549_cefe, CALU6_ceB3 

HMEC (total RNA), HMEC, BT474_ceQs 

293Tceto, PRSC_con, KT29_cefis 

HMEC (total RNA). HMEC, Fttxob!asts2 

HS578T_cefl3, 293T_ceBs, HMEC 

HMEC. HS578T_cefe, Caco2 

HNEC. HS576T_ceas, OVCAR_cefe 

EB.ceSs, MCF7, LNCaP.ceOs 

DU145_ce&, RPWEJ2. A549jceb 

HMEC (total RNA), Fibrobtasfa 2, Rbrobbsis2 - 

HMEC (total RNA), HMEC, Fibroblasts 2 

356 HMEC, HMEC (total RNA), EB.ceOs 

HMEC (total RNA). HMEC, FIbiobbsts2 

HMEC, HMEC (total RNA), EB.ceQs 

HMEC (total RNA), HMEC, EBjceOs 

LNCaP^cete, MCF7, DU145„ceb 

EB.cefls, HMEC, HMEC (total RNA) 

HS578T_ceDs, EBjceOs, LiLADJi23 

HS578TjceD3, EB^cete, HMEC 

EB.ceiis, HMEC, HMEC (total RNA) 
EB^cete, LNCaP_ce»s, RPWL2 
PC3_ce!b, EBjxIls, HT29_ce0s 
PRSC^con, ffi.ceDs, Lu^ADJi23 
EBjs&s, HMEC, HMEC (total RNA) 
MB231jceib, HT29_cell3, lu_SCJffi9 
EBjceOs. HMEC, HMEC (total RNA) 
HMEC (total RNA), HMEC, Lu_AD_H23 
HMEC. MB231_C8fls, Caco2 
UOCH460, PC3joeJls, FtoWasts2 
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109374 AA218727 Hi2107B5 ESTs; Highly staDar to Ibd1|Hsapfen^ 3.13 

131403 RK750 Ha26455 ESTs 3.13 

11342) T83964 Hs.15400 ESTs 3J1 

112532 R69824 H&28313 ESTs 111 

1179(5 N50782 H&231713 EST 3.11 

125349 TB7826 Hs. 164460 ESTs 3.1 

107072 AA609113 Hs.177533 H sapiens mRNA;cDNADKFZp5^0318 (from 

118389 N64583 Hs.182385 ESTs 335 

117653 N38970 Hs.194214 ESTs 334 

101082 L05072 Hs30645 interferon regiiJatDfyferiorl 304 

126105 H75323 Hs.167614 ESTs 333 

120006 W90108 Hs.10848 K!AA01B7gene product 103 

127191 AA297581 EST1 13160 GaDUaiMer I H sapiens cDNA 3.02 



106899 AA490107 Hs21753 JM5 protein 
112784 R96306 Hs.191290 ESTs 

113613 793337 Hs.17167 ESTs; Highly sfmOartoLKRFU-IWBfa 
ESTs 

HNL=nortroph3 ipocalln [human, ovarian 
Brfa-l-Giycoprotefn 11, Pregnancy-Specff 
dc*nregulated h ovarian cancer 1 
ESTs 



107631 AA007230 Hs35028 
101923 S75256 
100695 HG315T 

102523 U53445 Hs.15432 

121588 AA416615 Hs38242 

103714 AA047055 Hs.192943 ESTs 

104916 AA056588 Hs.16542 ESTs 

109928 H05961 Hs^6331 



ESTs 
ESTs 

coaguJd&jfifectorX 
.. . . .. carboi& anftydrase (V 

Ks.109108 solute c ante r famiy 22 (organic cafion 
Hs27038 ESTs; Moderately slmfer to X-toked ret 
ESTs 
ESTs 



HsJ20787 
Hs.47913 
Hs39485 



3JQ2 

332 
332 
ao2 
m 

3.01 
238 
234 
234 
233 
232 
232 
231 
23 
23 
239 
238 
238 



104586 R78309 
101236 L29433 
134749 L10955 
124703 R07294 
114108 Z38431 
107857 AA024687 Hs.61208 
111586 R10759 Hs.15177 

127553 AA282433 H sapiens p60 katan&i mRNA; compfete ctfa 237 

129881 AA458952 Hs.197728 ESTs; WeaWy similar to ZINC FINGER PRCT ZB6 

116852 H65459 Hs38323 ESTs 235 

133468 X03088 Hs.73931 major htoxHnpat^ comptex; cbss 232 

130998 C00810 Kfe-21370 guanine nucfectjde binding protein (G pr 232 

124075 H05741 Hs.101643 ESTs 232 

128108 A1247422 Hs.129966 ESTs 232 

128036 R15413 Hs.164919 ESTs; Highly similar to PROTEIN KINASE C 23 
126619 Z28861 HS8A7E032 STKATAGENE Human skeletal muse 

cDNAdoneA7E03,mRNAseq. 2.77 

114418 AA011383 Hs.177313 ESTs 277 

120383 AA228O30 Hs. 120234 ESTs 2.77 

126535 H73017 Hs250723 ESTs; WeaWy similar to atraphin-1 retet 2.76 

yclOoMsl Stratagene lung (#937210) H s 276 

Hs.141438 ESTs; Moderately simfer to simSar to C 2.76 
Hs.169355 ESTs; WeaWy stmfiarto TRANSCRIPTION RE 



119347 T64349 
126219 N36368 
125426 R43963 
103005 X52008 



Hs^700 glycine receptor; alpha 2 2.74 

109170 AA180352 Hs.191472 ESTs 274 

101125 L10373 Hs.82749 trarerrefrtoe4superfarr^nTen^2 2.73 

130656 Z20481 Hs.17411 KIAA0699 protein 2.73 

122933 AA476728 Hs.107537 ESTs 2.72 
126033 AA055978 Hs3807 ESTs; Weakly sknflar to PHOSPHOLEMMAN PR 

111644 R16539 Hs.223649 EST; Moderately srrnflar to Cd-7 Metalo 2.71 

133719 AA033790 Ks.75736 apoffpoprotefn D 2.71 

127555 AA582324 Hs.192857 ESTs 2.7 

113321 T70580 Hs.13759 ESTs 239 

103326 AA210719 Hs.86414 ESTs 268 

135003 H42527 Hs32832 ESTs 238 

103650 Z7O220 H^apterarnRrMfor5UTRfbrur^ 268 

111507 R07728 Hs.191218 ESTs 267 

117084 H93081 Hs.41829 ESTs 267 

103975 AA306264 Hs.176403 ESTs; Moderately similar to Ml ALU SUB 267 

132850 R89741 Hs38215 ESTs; Moderately similar to rhotekm[M 266 

121599 AA416770 Hs38255 EST 231 

124230 H63111 Hs.6555 ESTs 23 

114174 Z39055 Hs27264 ESTs; Moderately sfrnilar to ill! ALU SUB 238 

128469 T23724 H&258677 EST 237 

117399 N26480 Hs.43805 SpornaH^K;fostonpartoer-ae3 237 

129279 AA460551 Hs.184860 ESTs; WeaMy sfen3arto EG37B13 IDjnel 237 

119817 W74257 Hs.153690 ESTs 257 

114445 AA019594 Hs.250493 ESTs; Wealdy similar to KIAA03S0 Irisapi 236 

120651 AA287286 Hs39657 ESTs 235 

105707 AA291012 Hs37617 ESTs; WeaWy sfatibr to KIAA0727 proton 235 

128483 T58588 Hs3148 RN»o^ product 254 
125890 AA448739 Hs.116708 ESTs; Wealdy sbnibrto HYPOTHETICAL PRO 



Caco2, A549jdbQs» MB23Lcefe 

HS578T_cefis, HNEC MB23l_ceDs 

HMEC (total RNA), HMEC, S_ceSs 

HMEC HJ€C (total RNA), EELceBs 

HMEC HS578T_ce2s, Cacd2 

HS578Tce&, EB_cefls, MB-MDA435S 

11 LilSC_H59, MBMDM53, MB231_ce2s 

HMEC, HMEC, LNCaP_ce2s 

HMEC, HMEC (tobl RNA), FSnoblaste 2 

EELceQs, PRSC con, DU145_cefls 

HS578T_ce03, HMEC (total RNA), HMEC 

HMEC, HMEC (trial RNA), S.ceDs 

HMEC, Lu_ADlH23, LxlSQJ62D 

EB^ceDs. HMEC (total RNA), HMEC 

EB ceDs, HMEC Ui_AD 358 

HMEC (total RNA), EB.ce&s, HMEC 

LLSCJB45, HS578T_ce!ls, Lu_LCJW60 

PC3_ceQs, ffijcefe, HT29js& 

FfcrcWas!s2. hiJDJQZ, K©-MDA-435s 

PRSCcon, FBjrofaIast32, HMEC 

HMEC HS578T_ceas, BT474_ceHs 

HS578TjceBs, EBjcete, HMEC 

HMEC (total RNA), RbroWasfe2, HMEC 

HMEC MB231_cefls, HS578T ceBs 

CacoZ LiljAD^363, LuJ\D_358 

HfiffiC, HS578T_ce&, Caco2 

BT474_cete, MCF7, HMEC (total RNA) 

HMEC HMEC(tofaIRNA), h»WDM35s ~ 

HMEC HMEC (total RNA}, EB cefe 

HS578T_ceDs, MB231_cefe, H&EC 

HS578T_ceBs, Ul_LC JW60, PRSC^con 

EBjdbDs, MB-MDA-435S, RPWE_2 

EB.cefe, PC3_cefe, HMEC 

HMEC, Caco2. HS578T_ceDs 

MB-MDA435S, BT474_cefls, Hra_cefe 

LNCaP.cefls, Lu_SCJC45. EB.cefe* 

HMEC HS578T_celIs, Hra.ceBs 

HS578T_cens, U1J.CJW6O, Lu_SCJi69 

MB231_co3s. LuJDJ&Z, RPW^2 

HMEC UlAD_H23. HMEC (total RNA) 
HS578T_cefls, EB_ceQs, MCF7 
EBjceDs, Fibroblasts 2, HMEC (total RNA) 
Rbrctlasts2. PRSC.con, DU145_ceiIs 
EBjoeOs, hLfiDjm, U.SC_H69 
Lu_AP_H23, HMEC (total RNA), MB^DA435s 
275 HhCC, HAflEC (total RNA), Lu_SC_H69 
HS578T_ceDs, HMEC M&MDA453 
Rbrobbsts2, HMEC (total RNA), MB-MDA-435s 
LuJ-CJ+WO, 293T.cefis, EB_cefls 
HVEC{tofalRNA), HMEC, Rbroblasl32 
HMEC EB.cefe, HMEC (total RNA) 
271 UlSC_H345. ULSCH69, 293Lcefls 
EBjceUs, HMEC, HACC (total RNA) 
Caco2, Fa»oWas1s2, MBWU35s 
HMEC HS578T_ce0s, HMEC (total RNA) 
mEC (total RNA). FB)robbsts2 f PRSC con 
MB-MDA435S, HS578T_ceSs, Lu_SC_H© 
HS578T_ceDs, EBjxBs, PRSC.con 
HMEC, HS578T_ceHs, PRSCjoon 
HMEC (total RNA), HMEC EB.ceSs 
HS578T_ceb, HMEC, R«23Lcefe 
DU145j3Bb, HS578Tjcells, MB^VlDA-435s 
HS578T_ceSs, EB_cefls, 293T^ceCs 
HMEC (total RNA), HMEC. B^ceHs 
HMEC (total RNA), HMEC, Fibroblasts 2 
Caco2, MB-MDA-453, A549_ceQs 
LuJjC_H460, UJ_SC_H59, MBA©A-435s 
HMEC HMEC (total RNA), B_ceOs 
HS578T_ceb, EBjcefe, HT29jsfe 
HMEC HMEC (total RNA), Ui,SCJ«9 
HMEC Hi79_cefls. LuJ_C_#46Q 
HMEC HMEC (total RNA), Ftoobbsts2 
HMEC (total RNA), EBjsBs, BT474.ceSs 
HMEC, HS578T_ceBs, MB231 cefls 
234 Hfi£C (total RNA), HMEC, OVCARjceQs 
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134764 M74715 Hs.89560 Wuronidase; alpha-L- 154 

113404 T82323 Hs.70337 fannmiHTogtobutosupejfam^ 254 

129128 AA423854 Hs.1 08312 ESTs 254 

101428 M19684 Hs.1 84929 protease tnhftxtorl (a^l-antitrypsl 254 

103208 X72755 Hs.77367 monokine induced by gamma Interferon 253 

' 132273 AA489716 Hs.43558 DKRP588U51 protein 253 

108392 AA075124 zm86a1.s1 Stratagene ovarian cancer (#93 

IMAGES44776^mRNAseq 2.52 

119338 W37895 Hs.45519 ESTs 252 

109828 F13763 Hs.19827 ESTs 252 

135096 N89775 Hs.132390 zinc finger proteto 38 (KOX 18) 251 

130860 U66081 H&241395 protease; serine; 1 (trypsin 1) 251 

105725 AA292228 Hs.199791 STAT Induced STAT Inhibitor 3 251 

110427 H48579 Hs56275 EST 251 

123762 AA610O13 Hs.244553 EST 251 

128406 AA034096 z»6f05.r1 Soare3_fetayver.spteen_1NF 

IMAGE430017 5\ mRNA seq. 25 

13751 AA348065 Hs.1 11286 WAA0714 protein 25 

121704 AA418743 Hst98306 ESTs 25 

112585 R77783 Hs22404 protease; serine; 12 (wwbypsftmoto 25 

108499 AA083103 zn1 b12s1 Strafegene hNT neuron (#937233 

IMAGES^, mRNA seq 25 

131968 AA151333 Hs.36029 ESTs; Highly sknflar to bask; he&t-toop 25 

112665 R85661 H&221447 ESTs 248 

115764 AA421562 H&91011 anterior gradient 2 (Xeeepus laevfe) hom 248 

105959 AA405540 Hs.7001 ESTs 2M 

125804 R79519 H&16899 ESTs 248 

110102 H16681 Hs.180950 guanine nucteottte bending protein (G pr 246 

104680 AA009809 Hs57599 ESTs 246 

132339 D80O30 Hs.45127 chondrotBn sulfate proteoglycan 5 (neur 245 



121712 AA419116 Hs.1 93663 ESTs; WeaMy similar to HJl ALU SUBFAM1 245 
13226 M96843 Hs/i 80919 mNbSorof DNAbWlng2;Axn]nanlneg 244 
128731 AF005271 Hs.104555 neuropeptide FF^amJde pepftte precursor 243 



106670 AA461174 Hs5943 ESTs 243 

119306 T26914 Hs.1 32785 EAP30subunit of ELL complex 243 

133507 X74295 Hs.74369 integrin; alpha 7 242 

125713 AA367905 Hs.77356 transfenin receptor (p90; CD71) 2.41 

107438 W27841 Hs.17118 ESTs; WeaWy similar to B00252 [C.etega 241 

101784 M83186 Hs.1 14346 cytochrc^coxhtesesubuntVllapdyri 241 

134578 AA1S4724 Hs.182418 endonudease G - 24 

1E105 T95642 Hs.189759 ESTs 24 

127087 AA380418 Hs.88012 SHP2 interacting transmembrane adaptor 24 

113118 T47906 Hs220512 ESTs 239 

104791 AAQ29046 Hs.30377 ESTs; Moderately stmDarto cAMP rofocto 239 

115833 AA428269 Hs.125035 ESTs 238 

132223 R77451 Hs.4245 ESTs; WeaJclysIm2artosim2artoS.ee 238 

115836 AA428863 Hs.89388 ESTs 238 

101891 S45630 Hs.1940 aystalBn; alpha B 238 

132894 D82422 Hs5944 ESTs 237 

106939 AA498048 Hs26570 ESTs 235 

131104 W27770 Hs258721 ESTs 235 

122355 AA443789 Hs.189324 ESTs 234 
119343 T62873 yc3dis1 Stratagene king (#93721) H sapi 

to oontatos Atu repetitive elements mR 234 

115442 AA284722 H&89121 H sapiens mRNA; duomoscrne 1 specific tr 253 

134286 T69384 Hs58398 period (Drosophfla) hranobgl 233 

125465 AI375276 Hs.158732 ESTs 233 

127449 AI421866 Hs.75722 ftoophorinfl 233 

110225 H23927 Hs.222381 ESTs 233 

119930 W86471 Hs.1 51 624 hypocrefin (orexln) receptor 2 232 

125958 AI073357 Hs.1 2311 H sapiens done 23570 mRNA seq 232 

119746 W7Q279 Hs221189 ESTs; Weakry similar to 15-HYDROXYPROSTA 

108874 AA134112 Hs.1 07157 H sapiens DNA seq from cosmid ICK0721Q o 

L12 LIKE protein to an Irrtron of the HS 232 

127368 AA434362 Hs.193326 ESTs 232 

120437 AA243427 Hs.1 04311 ESTs 232 

119867 W80852 H525Q69S KDEL (Lys-Asp-Ou-leu) endoplasmic refc 232 

131205 J02947 Ks£420 superoxkie cfismutase 3; extracellular 232 

133710 X76057 Hs.75694 mannose phosphate Jsomerase 231 

104834 AA039331 Hs.1 6323 ESTs; WeaMy similar to GAGE-7 [risapen ■ 231 

113186 T56048 Hs.183674 ESTs 231 

113462 TB6826 Hs.142528 ESTs 231 

104743 AA021157 Hs53619 ESTs 25 

129667 Y00097 Hs.1 18796 aimaxfnA6 23 

111573 R10305 Hs.185683 ESTs 25 

117523 N32626 Hs.145532 ESTs; WeaWy similar to Gag polyproteto 229 



BT474_cefls, PRSCcon, HT29jcefls 
Caccfi, HS578Tjcete, HMEC 
BT474_cefe, MB-MDA-435S, HMEC 
HMEC, HT29_cefls, HMEC (total RNA) 
FtorobJasts2. MB231jsfis, HMEC (total RNA) 
EB.cete, HMEC, HMEC (toteJ RNA) 

HMEC (totel RNA}, HMEC. HS578T_ceHs 
LiCSCJ«9, CALUB^cefis, 293T_cefls 
PRSCJog, PRSCjcon. HS578T„ceQs 
HMEC, HS578T_ceSs f HT29_ce3s 
OVCAFLceSs, MB231_celis, PC3_c*fls 
HS578T_ce0s, HT29,cefls t HMEC 
HS578Tjsfls, Caco2, Lu JJCJM6Q 
HMEC (total RNA), HMEC, Ffcrohlasts 2 

UlAD Jt23, HS578Tjcefe, UlAD_358 
HMEC, HS578T_C8fc, FtorobbSts2 
EB.cefls, HMEC (total RNA), HMEC 
Fibroblasts 2, EB.cefls, PRSCjcon 

LNCaP.cefls, MB-MDA453, W£C 
RbroWasb2 A549_cefls, 293T_cefls 

uladjc3, mcc uacjm) 

EB cells, Caco2, MCF7 
OVCAfLcete, BT474_ceJJs, Caco2 
HMEC (totei RNA), EBLcefe, HMEC 
HS578T_ce&, HMEC, OVCARjslfe 
HMEC, HS578T_pbDs, Caco2 
OVCAR_ceto, 293T_ceb, HMEC (total RNA) 
m_SCLH52D, LlAD_H23, UlSC_H69 
MB-MDA453. 293Tj»Ss, Caco2 
HMEC, HMEC (total RNA), S.cefls 
EBjceils, HS578I_cefis, Lu_SCJi69 
EB.cefls, HMEC (total RNA), HMEC 
RbroWasts^ Caco2, EB.celb 
HS578T^ceJls, Fibroblasts 2, UUDJW 
HMEC, HS578T_cefls, MB231 - ceDs 
Fibroblasts 2, PRSCjcon, PRSCJog 
EB.cefls, HMEC. Lu_AD_H23 
EBjcelb, A549_ceBs. HS57BTjDeUs 
HMEC. HMEC (total RNA), EB.cefls 
M8-MDA-4355, HS579T_ceIls, HWEC 
LNCaPjcefis, OVCAfLceHs, PC3_cefls 
Caco2, LNCaP.celb, CAUJB.ceBs 
HMEC, HMEC (total RNA), EB.cefls 
HS578T.ce&, HMEC, PRSC jon 
HS578T_cefls, OVCAR_ceJls, LilWLH460 
Caco2. M&MDA453, HT29.cels 
LNCaP.cefls, 293T cells, EB cefls 
HMEC (total RNA). HMEC. HT29_ceSs 
HMEC (total RNA), HMEC. EB.cefls 

HS578Tjcefe. Lu.^Jt69, HT29_cefls 

Lu _ATLH23, HMEC (total RNA), BT474_cefls _~- 

HMEC, HMEC (total RNA), MB231_ce& 

HMEC (total RNA), EB.cefc, HMEC 

LilAD_H23, HMEC (total RNA), HMEC 

HS578T_ceDs, HMEC, LuJjCJM60 

HMEC, HMEC (total RNA), EELceQs 

MB231_cefls, HMEC (total RNA), HMEC 

252 HMEC, HS^ar.cefc, MB231_ceCs 

Caco2, PRSCjcon, UlCaP.ceQs 
HMEC (total RNA), HS578T.ce8s, HMEC . 
HMEC (total RNA), HMEC, MB-MDA-435s 
Fibroblasts 2, HS57flTjceDs, MB^DA-435s 
PRSCjcon, EB.ceQs, Lu_AD_358 
293T^cete, LNCaP.cefls, RPWEJ 
Caco2, HS578T.cefls, HMEC 
HMEC Ftoroblasts2. HMEC (total RNA) 
PC3 cefis, HS578T_cefls, HMEC 
HMEC (total RNA). HMEC, OVCAfLcefls 
PRSCJog, PRSC.con, HS578r_ce0s 
HMEC. HMEC (total RNA), EB.cefls 
EB.cefls, FBHOblasts2, HS578T_cefls 
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115540 AA349954 Ks5S281 ESTs; WeaHy sinter to ASB-1 protein [H 229 
101622 M5621 H&151513 manrasyj {alpte-l^-WycopD^n beto- 2JS3 
103535 Y1362D fe122607 &«flCLLrtymphDina9 228 
127482 AD37294 Hs.155014 ESTs 228 
104297 D31111 Hs.106005 ESTs; Hghjy arnfer to anfige 227 

112318 R55470 Hs.11067 ESTs 227 
101877 M97496 H&778 guanyto cyclase acfivatar1B(re&^ 227 

100760 HG3576 Major Histocornpa&Sy Cmnptex, Ctass 226 

102362 U39412 Hs.7332 N^ftyte^nWe-sereffivB factor attach 226 
106142 AA424590 Hs23S631 Gofei transport complex protein (90kDa) 226 

101461 M22430 Ks.76422 ptospho«pase A2; group 11A (pfertefeis; 226 

1193% T55340 Hs208238 ESTs 226 

127619 AA627122 Hs.163787 ESTs 225 

104113 AA427510 Hs.181202 ESTs; WeaWysim&rtoWisarftAfiinch 225 

131219 C00476 Hs243S5 smal inducible cytokine subfamSy B (Cy 225 

118915 N91481 Hs£4713 ESTs 225 

127556 AA57S831 Hs.190228 ESTs 224 

128700 U59286 Hsl103M2 ^indudbtBcyto^aibfem3yB(Cy 224 

113674 T96374 K&5753 lnosrtol{rnyo)-1 (or 4)nnmophospnabse 2 224 

133085 M73720 Hs.646 can^pej«daseA3(mastcea) 224 

106017 AA411882 Hs26268 ESTs 224 

100582 HG2348 PepfirJeYy 224 

134811 N66357 HsJ3761 ATP synffiase; H+ transporting; rrctochond 223 

102543 U57627 Hs234776 oatofebftXBral syndrome (rf Lowe 223 
127357 AA452788 zx39g11.r1 Scares JoteLMis_Nb2HF8_9w 

IMAG£:788900 5\ mRNA seq. 223 

135288 AA4Q2930 HsS7876 ESTs 223 

105581 AA27885D Hs28891 ESTs; Weakly sfcnflar to 1111 ALU SUBFAM1 223 
103812 AA137107 Hs.124094 ESTs; WeaHy shnQar to NFAT1-A [Mnuscu 223 

117016 HB7171 HsJ2170 ESTs 222 

114607 AA079342 Hs.129057 breast rarancrna arnpffied seq 1 222 

134000 U29091 Hs.7833 selenium binding protein 1 222 

111069 N58461 Ks22036 ESTs 222 

129048 L27670 Hs. 108227 [ntercdliilar adhesion moiecula 4; Lanos 221 

124995 T52700 Hs.110044 ESTs 22 

116678 FD5063 Hs251736 ESTs 22 

118222 N62263 Hs.48501 EST 22 

127888 A1149662 Hs.143590 ESTs 2.19 

113790 W33178 Hs26912 ESTs 2.19 
100097 AF002224 H sapiens Angelman Syndrome Gene, E6-AP 

from promoter P1.5UTR 2.19 

109151 AA17680O Hs.73452 ESTs 2.19 

135368 AA086057 Hs3964 noosoma) protein; fTtohondnat S12 2.19 

109016 AA156936 Hs58069 ESTs; tygHysbnfar to type HcAMP-dep Z19 

124300 H92575 Hs.105959 ESTs; WeaHy simfer to fill ALU SUBf AMI 2.18 

123450 AA598913 Hs.111207 ESTs 2.18 

117435 N27628 yw50b08.s1 Weizmann Ofectoiy EpitheHum 2.18 

119860 mam Hssms ests us 

123833 AA620717 Hs.112889 ESTs Z18 

107938 AA029446 H&53115 ESTs 2.17 

119380 TB3659 Hs.184407 ESTs 2.16 

114066 Z38152 Hs26920 ESTs 2.15 

128748 T59001 Hs.10475 ESTs 2.15 

130414 M21121 Ks241392 smafl Induct QftokBi8A5(RANTES) 2.15 

123490 AA599723 TAP binding protein (tapasto) 2.15 

112588 R773Q2 Hs20226 ESTs 2.14 

110548 H58715 Hs.14706 ESTs 2.14 

101581 M34996 Hs.198253 n^Wstocompaa^con^^dass 2.14 

115248 AA278887 Hs.194530 ESTs; WeaWy sJmSar to unknown [H^apie Z14 

105619 AA280810 Hs24003 ESTs; Moderately similar to LEYDIG CELL 2.14 

128058 A1126817 Hs.132449 ESTs 2.14 
134573 AA442125 Hs.171873 ESTs; WeaHy srmferto PUTATIVE STEROID2.14 

134863 AA353903 Hs.183373 ATX1 (antioxidant protein 1; yeast) homo 2.14 

128811 H17317 Hs.169100 ESTs; WeaHy similar to HPBRIk7 protein Z13 

112368 R59371 Ks26653 EST 2.13 
108335 AA075144 zm86f&s1 Stratagene ovarian cancer (^3 

gbOC1664 TRANSLATIONALLY CONTROLLED TUM 

129611 D45680 Hs.11614 ESTs 2.13 

101253 L34355 Hs39931 sarcogtycan; alpha (50kD oystrophin-asso - Z12 

126701 AA515212 Hs2Q2590 ESTs; WeaHy simfer to mucin glycoprote Z12 

111628 R15825 Hs.4014 WAA0946 protein; Hurrfingfin irrterac&tg 2.12 

108675 AA115240 H&61816 ESTs 2.12 

127131 Z44658 Hs.105460 DKFZP564O0823 protein 2.12 

109990 F02465 Hs27281 ESTs 2.12 

116539 D12124 Hs242890 EST 2.12 

112117 R45402 Ks23789 ESTs 2.12 



Rbrottasts2, BT474_cefls. MB231_ce2s 
PRSCjcon, RPW£_2, PRSCJog 
ULSCJ169, LuJU)J58, UlADJC3 
HS578T_cefe, 293Tjse&s t CALU6 v cefls 
ffi_cefa, DU145^ceb, KT29_ce& 
M&WDA453, LNCaP.cefis, OVCAR_cefls 
KT29_ceSs, BT474jcefls, Caoo2 
FV&WDA435S, MB231_cefe, BT474__cefc 
LNCaP joeQs, MB-MDA-453, Ceco2 
HMEQ, HS57HTjDeBs, Caco2 
LNCaP_ceDs, BT474__cefe, Caco2 
HS57BT..cefls, EBjcelS. HMEC 
IolSQ_H52D, UUJC_H46D, Lu_SC_HB9 
MB-MDA-4353, F2jrobIasts2, HMEC (total RNA) 
ULSCLH520, BT474_ce0s, FtoobIasts2 
HMEC (total RNA), HMEC, MCF7 
HS578T^cefls, S_cefe, HMEC 
HMEC, HS575T,ce2s, Rbrobtasts2 
A549_ceDs t DU145_ceDs, LuJUL358 
HS578T_ceQs, Hbrobtosts2, HT29_cefe 
MBMWM53, 0VCAfl_C8Cs, 293T.cefls 
HMEC, HS57OT_cefe, HfvEC (total RNA) 
Lu_SCLH520. IWCaP^cefls, LitAD_H23 
293TjceQs, EB.cefe, LNCaPjcefe 

HS578T_cefe, RPW^JJ. HMEC (total RNA) 
HS578TJ53&, 293T^cefls, OVCAfLcefls 
BT474^ceib, BT474JSB3. MK31 cefis __ 
Lu_SCJC45, LILAPJC3, PRSCjcon 
Rbrobtosts^ LuJ_C_H460, HMEC (total RNA) 
BT474_ce!ls, Hra.ceHs, HT3 cefis 
LNCaP^ceHs, MB^DA-453, BT474_ce0s 
HMEC, UJ5CJ045. HS578T__cefls 
LILADJC3, HS578T_cefls, UlSCLH520 
Caco2, r»MDAr453, HTBjoefe 
HS578T_cete, BT474_cefe, 293T_ceD3 
HS578T_ce2s, BT474_cefls, M8231^cefls 
BT474_C8Bs. CAUI6_cefls, MK31_ce2s 
HfvEC, HMEC (total RNA), Fibroblasts 2 

HS578T_ceDs, CALU6_ceBs, 293T ceils 
CALU6.ceJis. LilAD_H23, LilSC J«9 
OVCAR_ceBs. A549_ceSs, UlAD_H23 
HS578T_ceQs, BT474_cefc 1 A549_ceils 
LlLAD.358, LiuSCJ<69, Lu.SC _H345 
HfVEC (total RNA), HMEC. MB^DA435s 
LNCaP.ceQs, DU145_cefls, U.SQJi520 
HS578T_ceDs, MB231_cefis, Caco2 
LiLADJ^ Lu_SCLH520, UtAD_35B 
Caco2, 293Tjoell5, 293TjDeQs 
Lu_AD_H23, Ui lAD_358. PRSC.con 
H^C (total RNA), HMEC, EBjcefls 
HrVEC, HT29.cefls, MB23Lcels 
HS578T_C80s, PC3_ce0s, A549_ceDs 
HS578T^ceDs, EBLceDs, IOLSCJI69 
HKEC (total RNA), HMEC, Rbrobbs1s2 
HMEC, HrVEC (total RNA), HT29_cefis 
MB-MDA435S, HMEC, Hfv£C 
HT29_ceDs, BT474_ceSs, CALUe^ceDs 
Lu_SCLH520, ft»M)A435s, LNCaP.ceDs 
HS578I_ceD3, EB.ceDs, HMEC (total RNA) 
EB.ceBs, MB231^cefls > Caoo2 
Lu_SC_H345, HT29_cefis. BT474_cells 
Caco2. Lu_SC_H345, EB^ceOs 
HMEC, HMEC (total RNA), U1JSCLH520 

2.13 HMEC (total RN^, HMEC, OVCARjsfls 
HMEC, HS578T_cefls, Caco2 
HS578T,ceDs. OVCAFLcefls, CALUoLceDs 
EB^cefe, Lu_AD_H23, LilADJQ3 
A549_ce5s, 87474.06113, MB-MDA-4353 
LuJ^DJC3, M&WDA-453, PRSC con 
EB_ce8s, liL.SCJi69, LO.SCJI69 
HMEC, HS578Lcefls, HMEC (total RNA) 
ULADJC3, Caco2, BT474_ceQs 
EBLceDs, Lu^APJi23, UiJSQJ{520 
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126367 AA477929 H&25584 ESTs 2.12 

135252 UB2966 Hs.97207 solute carrier family 28 (sodkinvcoupted 211 

117565 N34301 Hs24B426 EST 2.11 

129430 AA258842 Hs. 197877 H sapiens dona 23777 putefive transmemb 2.11 

120256 AA169801 sema domah; ImnuinogicMln domain (Ig); 111 

134169 D20342 Hs.178137 transducer of ERBB2; 1 (TOB1) 2.11 

130397 AA487452 Hs, 155344 DMA fragmeniafion factor; 45 kD; afcha s 211 

132859 D20925 H&5842 ESTs 2.11 

117633 N36404 Hs.44807 ESTs 2.11 

125003 T59442 Hs, 100445 ESTs 2.11 

125329 AA825437 H&58875 ESTs 211 

114065 238149 Hs.134015 imxiyl 2-sdfotransfefase 2.11 

120718 AA292747 Hs.97296 ESTs 2.11 

133869 T49444 Hs.77031 Sp2 transcription factor 21 

135351 AA430179 Hs.9933 putaflve Aofflte transpose 21 

110973 N51529 Hs.1 18047 ESTs 209 

131879 AA017161 Hs33792 ESTs 209 

116656 F03935 H&241640 EST 209 
120311 AA194074 Hs.1 93401 ESTs 209 
108024 AA040433 Hs.61898 DKF2P586N2124 protein 209 
105871 AA399633 Ha24872 ESTs 209 
120206 Z40805 Hs.91668 ESTs 209 
112333 R56222 Hs26514 ESTs 209 
116746 H04811 Hs.79027 ESTs 208 
121529 AA412257 Hs38121 ESTs 208 
105592 AA279337 Hs.180549 ESTs; Highly sfmter to R26660 J; parte 208 
108582 AA088231 Hs.91732 ESTs 208 
123197 AA489250 Hs.59403 serine pa^ftoyftransfefass; subunit n 208 
134965 J05480 Hs.92 protein phosphatase 3 0omwV2B);cat 208 
123856 AA620814 Hs.144959 ESTs 208 
132058 AA251737 Hs.172818 Apg12 (autophagy 12; S. cerevtefee^lke 207 
126478 R94666 Hs.195155 ESTs; WeaUysfrnDar to transporter prot 207 
106087 AA418740 Hs21111 ESTs 207 
103802 AA122003 Hs.62954 femfin; heavy polypeptide 1 207 
125633 AA908225 Hs.1 26841 ESTs 207 
112817 R98491 Hs.14584 ESTs 207 
111050 N56984 Hs.74335 heat shock 90kD protein 1; beta 207 
133072 AA425294 Hs.64322 ESTs; WeaWy slmSar to Closely related 207 
118270 N62668 Hs.48653 ESTs 2.07 
105035 AA128486 Hs.8859 ESTs 207 
102337 U36922 Human fork head domain protein (FKHR) mR 2.07 
109687 F09380 Hs, 182859 Ifeguard 206 
109802 F10789 Hs.1 2439 ESTs 206 
128103 AA905960 Hs.48516 ESTs 208 
128278 A1018343 Hs.131275 ESTs 206 
131873 H39997 H333716 ESTs 206 
122683 AA455528 Hs36772 ESTs 205 
128066 AA884B38 Hs.189171 ESTs 205 
131451 N25028 Hs26968 H sapiens mRNA from chromosome 5q21-22; 205 
120887 AA365644 Hs.97043 ESTs 205 
103966 AA303166 Hs.1 27270 ESTs 205 
105861 AA399260 H&28454 ESTs 205 
104627 AA001976 Hs.19603 ESTs 205 
108794 AA129468 Hs203392 ESTs 204 
111896 R38936 H&24894 H sapiens done 25248 mRNA seq 204 
101849 M94167 Hs.172816 neuregulinl 204 
119913 W85931 Ha5S785 ESTs 204 
130785 AA242826 Hs.19405 caspase recmfmient domain 4 204 
124702 R06984 Hs.7745 ESTs; Weakly similar to TESTIS-SPECIFIC 203 
106769 AA478001 Hs.225935 oTacylgfycefoi O-acyitransferase (mouse) 203 
132219 N48682 Hs.172971 ESTs 203 
122033 AA431334 Hs.1 09297 ESTs 203 
120461 AA251301 zs10b02.s1 NCLCGAPJ3CB1 H sapiens cDNA 

contains A)u repetitive etement;, mRNA 203 

134959 U90550 H&91813 butyrophEn; subfemfly 2; member A2 203 

104909 AA055892 Hs.14543 ESTs 203 

101950 S79219 Hs£0741 rjrcfitonyi Coenzyme A ca^ 203 

133878 D78947 Hs.7718 ESTs; WeaWy simSar to weak similarity 202 

103459 X99894 Hs.32938 insulin promoter factor 1; rraneodomaln t 202 

125507 AM36377 Hs^58590 tetraspanin TM4-B 202 

116657 F04014 Hs.65996 ESTs 201 
112920 T10234 Hs.4275 ESTs 201 
105533 AA258572 Hs.6418 ESTs; Moderately similar to seven transm 201 
126762 AA064671 zm13b04.r1 Stratagene pancreas (#937208) 

similar to TR-G413842 G413842 NONCLASSI 

128999 R37808 Hs.1 07765 ESTs 201 



Lu_SC_H69, ULADji23. ULADJ58 
MB-MDA-435s, 293T_ceIts I CAUJ6jceBs 
HMEC. HS578Tj*0s, MB231_ceDs 
HS57BT_cefi3, LlAD_358, MB-MDA-435s 
HMEC, HMEC (tofel RNA), EBj»Ds 
HMEC (total RNA), 293T_cefc, OVCAFLceDs 
293T_ce&, Caco2, Lii_ADJ*23 
HMEC (total RNA), Rbrobfasts2, HMEC 
HMEC, Caco2, HS578T_cefls 
MB-MDA-4353, HMEC (total RNA), HT29_ceIIs 
HS578T_cete, PRSC_con, PRSCJog 
MB-MDA435s, 293T.cefls, PRSCjan 
HT29.ceBs. LuJU)_H23, Lu_SC_HS9 
Lu_LCJ4460, LLADJ58, RPWEJ 
HS57aT_ceB3, EBLcefis, HMEC 
EBjoeDs, HS578T_ceSs, MCF7 
HMEC (total RNA), MB231_ceBs. BT474_celb 
HS578T_ceDs, UOjCJW®. U_SC_H69 
OVCARjcefis, HMEC (total RNA), HMEC 
HMEC (total RNA), BT474_ceBs. HT29_cefls 
Fibroblasts 2 A549_ce2s, HS578T_cefis 
BT474_ce0s, MB-MDA-453, EB_cells 
ULADJC3, Fibroblasts^ ULMLH460 
MB-MDA435S, HMEC (total RNA), Lu_SC_H345 
HMEC, HMEC (total RNA), HS578T_ceQs 
LNCaP^cens, PRSCJog, PRSCJog 
HS578T_ceBs, Lu_SCJ*345. Lu_SCJ«9 
EB.cefis, U.SCJi69 r U|_SCJ1345 
LNCaP.cels, MB-MDM355, HMEC 
HS578T_ce!ls, BT474_ceft3, BT474_ce83 
HS578t,ceb, MCF7. HMEC 
PRSCJog, UUJCJW60, RPW^J2 
OVCARjceBs, A549_ceHs, UlAD JC3 
HMEC, HMEC (total RNA), HS578T_cefls 
EB.cefis, Ftocbfast32 Lu__SCJ<69 
HMEC, HMEC (total RNA), Rxobbsts2 
LNCaPjoeOs, DU145_ceIIs. 293T_cefls 
LNCaP.ceBs, M&MDA-453, Caco2 
HMEC (total RNA), HMEC, ESLceKs 
LWCaP^ceBs, PC3_ce0s, EBjoeDs 
293T_cefis, HMEC, HT29_ceils 
BT474_cells, BT474_cels, LuJVD_H23 
EB cells, EB cells, Caco2 
HT^LceQs, HMEC (total RNA), HMEC 
PRSC_con, U_SC_H345, HS57ST_celte 
HMEC (total RNA), HMEC, EB.ceQs 
LNCaP.ceb, UiJULH23, HS578T_cete 
HMEC, HMEC (total RNA), Fibroblasts 2 
MB-M3A-435S, UOCJ{460, U_SQ_H520 
HS578T_ceBs, PRSC.oon. HMEC 
HMEC (total RNAX UTCaP.cefls, PC3.cete 
Fa»obIasts2, HMEC (total RNA), EB_ceOs 
HS57BT_cete, HMEC, BT474_ceHs 
HS578T_ce0s. HMEC, A549^ce& 
HS578T_ceDs, PC3josb, 293Tjcete 
HMEC, HS578T_cete, HMEC (total RNA) 
HMEC, BT474_cete, MB231_ceils 
HMEC HS578T_ce&, BT474_ce2s 
RbrobIasts2 PRSC.con, HMEC 
PC3_ce&. EB_ceBs, HS57BT_ceBs 
HT29„ceDs» PC3_cefe, A549i.ce& 
OVCAFL.C8&, A549joeQs, Caco2 

HS578T_ceQs, EBLcefis, EBjoeCs 
HMEC. Fibroblasts 2, EB.ceRs 
Lu_SCJ^345, PC3jDefis, DU145_ceb 
ULSCJ169, EB.cefc, CAUUG^ceils 
EB_cefls, MCF7, MB231_cefls 
EBLcefis, LuJVDJi23, LuJVD_358 .. 
A549_ce0s. U_SCLH520, LilADJC3 
HS578T_cefls, HMEC, MB231_ceCs 
HS578T_ceils, HLceDs, PRSC^con 
HS578Tj»Qs, HMEC, EBjceDs 

201 RPWU UiJDjUZ, Lu^Dj35B 
HS576T„ce!Is, OVCARjceBs, EB^ceb 
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Table 2 







Pkey: 


Umqne Eos probeaet identifier panto 










ExAccn: Exemplar Accession number, Gcnbank aco 


cflfijop mi inner 








UrugenelD: Unigene number 










Umgene Title: Unigene gene title 






Pkey 


ExAccn 




i#oinpt0u)_i iiio 


Ratio Metsi 


BS Top 3 expressing cell Ones 


101447 


M21305 




n Lunan apjT3 saijpmp ana saisiiuB o ju 


110.98 


EBjcete, fibroblasts 2, A549 ceBs 


105039 


AA1 30349 


He. 


ESTs 


9.13 


sTceDa OVCARceUa Lu SC H345 


106034 


AA419461 


Up 40407 


ESTs 


&51 


HT29 ceDa MB-MDA453. HS578T cells 


105777 


AA348412 


nSuuuSQ 


EST3 


8.4 


293T cela OVCAR cella EB cells 


129818 


1454841 


Lin 4 FtL. ft 

MS. 172572 


ESTs 


12 


Lu SC H69 EB cela Lu SC H345 


118475 


K66845 


ll_ 4CCA44 

Ha 165411 


ESTs; Weakly slmfar to Ml ALU CLASS B 


7 


DU145_cefe. EB cella Caco2 


112170 


R48744 


Ha192B7o 


ESTs 


6^1 


293T cela DU145 ceSa HT29 cells 


114918 


AA236813 


Ha/2324 


ESTs; Highly similar to unknown [H^apte 


6.6 


EB crib. 293T ceBa DU145 eels 


104590 


R79750 


rHLOJOZJ 


nudear receptor subfemBy 1; group 1; m 


&58 


oorr cote OVCAR cefe. HMFC 


120625 


AA285Q53 


Lin 4 1*174 CO 


ESTs 


6.55 


CALUB cete OVCAR cefls. EB cells 


115650 


AA404564 


LL. ATftTlA 


ESTs 


6.43 


co relh LNCaP ceHs. Lu SC H345 


124568 


N 67086 


Lio 4fw>nn 


ESTs 


635 


PC3 cella A549 cella DU145 cells 


134238 


R81509 


Lin 4 0iC74 


spSdng factor; aig^iiefeerine-rich 1 1 


632 


293T_cefla Lu_SCJ^345, HMEC 


114721 


AA1 31450 


tin 4IVJO*lO 

Hs.103822 


ESTs 


6.13 


CacoZ MB-MDA-435a PRSC kn 


106145 


M424791 


Hs.5734 


K1AA0879 protein 


6 


ryVCAR cells. EB eels. 293TT cete 


114610 


AA081079 




zn32h9n?1 Stratagene endo&iefial cefl 93 












lMAGE5491853',mRNAseq 


557 


PRSC con DU145 cete HS578T cells 




R12777 


1 1 4 cone 

Ha 15395 


ESTs; Weakly snnferto ARGINYL-TRNA SYN 


c Q4 PRSC con HTM cells. EB rpJh 


124690. 


Rflfiftlft 


HS. 1/3630 


ESTs 


5.92 


LNCaP ceh PR cells. OVCAR cete 


113490 


T88700 


Ha173374 


ESTs 


5.81 


DI 114*5 cete. PC3 cete. HMFC (total RNA1 


104425 


H88496 


Ha 40583 


ESTs 


5.77 


OVCAR cete HS578T cete A549 cete 


118828 




H&9U824 


EST 


5.45 


LNCaP cete OVCAR cete. DU145 cete 


129076 


AA262179 


Un 4CM11 

H3. 153343 


ESTs 


535 


293T cella BT474 cete. MCF7 


109684 




K&140885 


ESTs; WeaHy similar to UNE-1 REVERSE T 524 


FQunblas(s2. Lu SC Hfi9 011145 mte 




IwOvfO 


Ha8B959 


Human DMA seq from clone 967N21 on chr 2 










part of WAA0172; the gene for a novel 


532 


FR cete. PC3 cete. Lu SC H345 


109032 


AA158234 


Ha/2222 


ESTs 


5J23 


HT59 cete. PCS cete Lu AD 35fl 


129350 


U50535 


Ha 11 0630 


Human BRCA2 region; mRNA seq CG008 


5\2 


9Q3T cete. EB cells. DU145 cete 


11266? 




Lin im^en 

Ha193150 


ESTs 


5.2 


MR-MDA-A.'Ws PRSC cot MRJuDA-dTC 


132902 


AA490969 


Ha168i47 


ESTs 


5.18 


PC3 cells. LNCaP cete. CALLS cete 


126872 


AA136653 




ESTs 


5.04 


PR cete Fihmhkwts ? AS49 cells 


199528 


AA449804 




EST 


5.04 


1 ti SC H345 PRSC cen LNf^aP cplts 






Ha313 


sstibted phosphoprofein 1 (osteoponfin; 


5.02 


III LC Hlffl. A549 mils. MR-MT)A-d!V« 


121332 


AM04384 


Ha 97921 


ESTs 


5.01 


PR cete. Lu SC Hfi9 DU145 cete 


135357 


AA235803 


Ha/95/2 


cathepski D (lysosomal asparfy) protease 


456 


PR cete MCF7 LUJ145 cete 


109141 


AA176428 


Uh IQTOQfl 

naiauou 


ESTs 


436 


DLI145 cete PC3 cete PRSC too 




AA082041 


Lin oom 


ESTs 


433 


PR cete Lu SC H345. HS578T celb 


124875 


R70506 


Ha207K£3 


ESTs; WeaWy stater to IID ALU SUBFAMI 


4.75 


DU145 cete OVCAR cete. LNCaP cete 

UU ltw_U3*n?| VlwrV\.\nHW| U1var_WIO 


102380 


U40434 


rtai3K»i 


mesofhelh 


4.71 


OVCAR cete Lu AD H23. RPWE 2 


127956 


AA826117 


naiywio 


ESTs 


4.69 


pr cete, HS578TjcefiSp DU145_ceSs 


125038 


T7B089 


ru*. IDOOOr 


ESTs 


43 


OVCAR cete 293T cells. DU145 cete 


102515 


U52696 




Humn adrenal Creb-rp hmjg (Creb-rp), com 


4.57 


Lu SC H345 Lu SC H69 HT29 cete 


109027 


M157818 


flit ivftuW 


Human endogenous retroviral protease mRN 


437 


PC3_cefla EBjceBs, Lu_SQ_H520 


115096 


AA255991 


HS.17531Q 


ESTs 


437 


OVCAR ceils. 293T ceOa PC3 cete 


123470 


AA599106 


We 1 0A 7AA 


ESTs 


435 


LKCaP.cefis, Lu^SCJ^ra, 293T_cete 


113219 


T59257 


no. urftu/ 


ESTs 


435 


A549 cete. 293T cete 293T ceils 


123433 


AA598661 


Ue 119J7R 


ESTs 


435 


PR rete. OVCAR cella HT29 cete 


135162 


M28170 


He Qfift91 


C019 antigen 


433 


OVCAR.cels, 00145.06115, EBjceHs 


121721 


AA419470 


Ha199961 


ESTs 


431 


DU145_cete, LNCaP.cefls, EB.cete 


129126 


H88486 


Ha108806 


ESTs 


4.45 


LNCaP.ceQa Caco2,"EB_cete 


135232 


AA342457 


Ha96800 


ESTs; Moderately stmfiar to till ALU SUB 


4.43 


LNCaP cete, DU145 cete, OVCAR.cete 


124847 


R60044 


Ha106706 


ESTs; Highly similar to BETA-CATEN1N (H. 


4.42 


OVCAfCceDs, CALU6_celIs, CALU6 eels 


110349 


H40988 




ESTs; Weakly similar to IIII ALU SUBFAMI 


439 


DU^^cete, OVCWLcete, LNCaP.ceHs 


134402 


U25165 


Ha82712 


fragile X mental retardation; autosomal 


438 


HS578L-Ce0a OVCAFLceils, DU145j»te 


115494 


AA290603 


Ha256517 


ESTs 


436 


Lu.SC JI345. OVCAILceDs. PC3.cete 


119174 


R71234 




yi54o0&s1 Soares placenta Nb2HP H sapie 










transcript (rRNA); girS41458 ROD CGMP- 




DU145.cete, OVCARj»lls, LNCaP.cefa 








BETA-SUBUNfT (HUMAN);contam 


433 


121943 


AA429265 


Ha126759 


ESTs 


43 


EB_celis. HT29.ceBs, Lu_SC_H69 


110856 


N33053 


H&23291 


ESTs; WeaJdy sfenBar to S164 [Rsapfens] 


428 


0VCAR_ceil3, EBjste, LiLSC_H69 


102474 


U49973 




Human Tggerl transposable element, camp 


428 


DU145.cete, LNCaP.cete, OVCAR^ceils 


123458 


AA598963 


Ha1 12499 


WAA0512proteki 


427 


A549_ce&, A549_cete. BT474j»Ds 


116459 


AA621399 


Ha64193 


ESTs 


422 


Caco2, HS578T_ceIls, MB-MDA435s 


126301 


N62371 


Ha100043 


ESTa-WeaWysknlartoSmtotocutic 


422 


PC3_cete, DU145_cefla U.SCJW5 


123461 


AA598990 


Hs251119 


EST 


422 


Lu^SCJi345, L1LSCJI69. OVCAR.cete 
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130588 AA2S7735 Hs.16411 Hunan DNAseq tan cta»1189B24 on chro 



MLRQadwnapc 1333; EC 13393; 
Tyrosmeisrotein Kinase FER (EC 2J.1.1 42 
125755 W25498 Hs31634 ATP synthase; H+ transporting; n fa tond 42 
135009 AAO405Q7 HS251865 ESTs 4.19 
107001 AA5S3589 Hs24492 ESTs 4.18 
124896 R82063 Hs.101594 EST 4.16 
119404 TB295D ye27c10.s1 Stratagene tung (#937210) H 5 4.15 

125090 T91518 ye20a>5^1 Straiagw tug (#837210) Hs 

contains Akj repeflSva eteniaiLcorrtah 4.14 
117348 N24157 Hs.139615 ESTs 4.1 
111389 N95837 Hs.169111 ESTs; WeaWy similar to L82A [Dan^anoga 4.1 
134977 AA46469B Hs.19390 ESTs; WeaWy slm&rto bufious perrtftfgo 4.09 
124698 R0S273 Hs.186467 ESTs; Moderately sinter to HI! ALU SUB 4.09 
124090 H0957D Hs.143032 ESTs; Weakly sMar to rieurorcrf thread 338 
133992 R46354 Hs.169832 zinc finger protein 42 (myeta&specffic 338 
126009 H51652 Hs242985 hemoglobin; gamma G 336 
114161 238904 Hs22385 ESTs; WeaMysimiar to K1AA0970 protein 334 
109171 AA180356 HsJ3700 EST 334 
122)07 AA430629 Hs38564 ESTs 333 
131936 AA094865 Hs.179972 krierferon; alpha^ndudbte prufein (do 33 
128668 AA194849 Hs.103422 ESTs 33 
124977 T33859 Ks.190452 WAA0365 gene product 339 
107048 AA600012 Hs.10669 ESTs; Moderately slrnlbr to WAAG400 [H. 339 
105358 AA236034 H&25362 ESTs 339 
135106 AA599037 Hs.9456 SW1/SNF refated; matte assccd; ac&i de 336 
106686 AA463215 Ks29896 ESTs; WeaMy sbnferto profee-rich pro 335 
132093 AA400091 Hs39421 ESTs 335 
128651 AA446990 Hs.103135 ESTs 334 
102459 U48936 Hunianairobnc^er^v^efS^sod 334 

113732 T982B8 Hs.193295 ESTs; WeaWy similar to UD ALU SUBFAMI 332 

1160G0 AA448710 Hs.41327 ESTs 332 

120748 AA303153 Hs237994 EST; WeaWy sirnBar to ALU SUBFAML 332 

116318 AA490830 Hs38570 deteted in cancer 1; RNA he&ase HDB/DI 3.79 

1143® Z41747 Hs.469 sticcmateitehydrcgenasecar^ 378 

107248 D59894 Hs34782 ESTs 3.75 

132713 AA286906 Hs35335 ESTs 3.75 

102222 U24683 Hs.159386 Immunoglobulin heavy variable 44 3.73 

108201 AA057518 Hs.63394 ESTs 3.72 

119940 W86779 Hs, 171 807 DKFZP586B0319 protein 3.71 

106503 AA452590 Hs30348 ESTs 3.67 

114360 Z41592 Hs22129 hypothetical protein 337 

100991 J03764 Hs32085 piasniinogen activator inhfcrton type I 337 

107580 AA002091 Hs.175476 ESTs; WeaWy similar to llfl ALU SUS^AM 337 

111685 R21408 Hs.106095 ESTs 336 

128336 A1242720 Hs. 146043 ESTs; WeaWy similar to atemafiveJy sp 3.66 

13)868 AA004900 Hs. 171 91 7 ESTs; WeaWy smk to smfrto glycerophos 3.61 

116802 H44061 Hs.194026 ESTs 3.6 

130753 Z46632 Hs.189 phosphodiesterase 4C; cAMP^peciSc {dun 33 

123074 AA485117 Hs.105653 ESTs 3.6 

114317 Z41038 Hs.469 succinate oerrydrogenase comptex; subunit 3.6 

134194 AA233231 Hs.79828 ESTs 339 

127752 AA8Q8388 Hs211167 ESTs 339 

123526 AA6Q8657 ESTs; Moderately sbnter to HI ALU SUB 339 

127917 AA211895 Hs.118831 EST; HJgrty sirdar to dJ1163J1 .2.1 [Hi* 338 

105941 AA404427 Hs.10669 ESTs;ModeraterysImi}artoKlAA0400[H. 338 

124694 R05108 Hs.135258 ESTs 336 

105655 AA282571 H$2Q3772 FSHD region gene 1 336 

111168 N66951 Hs238380 Human endogenous retroviral protease mRN 335 

133254 AA156670 Hs.180780 H sapssns agrin precursor mRNA;parfial 334 

132640 U33821 Taxi (hurr^T^leijJcernlavhJsrypel 333 

116562 D25807 Hs30145 ESTs 332 

126045 N80361 Hs.14248 ESTs 331 

122878 AA465341 Hs.99640 ESTs 3.47 

105220 AA210695 Hs.17212 ESTs 3.47 

127001 AA731636 Hs39319 ESTs; WeaWy simflar to HQ ALU SUBFAM 3.45 

112693 R88741 Hs310S5 ESTs; Moderate^ sfmliarto praSerafio 3.44 

104935 AAD83280 Hs35552 ESTs 3.43 

128710 J04813 Hs.104117 cytochrome P450; subfamily DIA (n^hedl 141 

1319SB D86956 Hs36927 heat shock 105kD 3.4 

119229 T03229 H sapiens (clone 104) retinoblastoma 1 g 3.4 

128046 AA873285 Hs.137947 ESTs 3.39 

105175 AA186804 Hs25740 ESTs; WeaWy similar to ub^fiausTPRm 339 

132349 Y00705 Hs.181286 serine protease tnrt&ar^KazaJtype 1 338 

101559 M32053 Human H19RMA gene, complete cos 337 

116389 AA599011 troponin T1;steferat slow 336 



EB_ce*s, LNCaP_cefls, MCF7 
HNEC, S.ceOs, DU145_cdb 
293T_cefis, EBjceBs, DU145_ceQs 
293T cefls, DU145 ceSs, EB ceQs 
OVCAR^ceb, Lu_SC_H345>ltt£C (total RNA) 
DU145_cefe. PC3_cefis, Fforob)asts2 

IflCaPjsOa. DU145_ceSs. OVCAR_ce& 
LilSC JC45^ liLSCjm PRSCJog 
DU145_cefls, MCF7, LNCaP_cefe 
OVCARjceQs, Fibroblasts 2, Lu_SC_H59 
OVCARjaDs. L1LSCJ045. PRSCjan 
DUI^cefia, OVCAfLcefe, LujSC_H345 
HT29_cefc, MB231_cefis, BT474josfe 
Lu_SC JJ69, OVCAR_cefc, K.cefc 
HS578T_cefc, SLcefe, PRSCjsn 
293T_ceUa, MB-MDA-435S, A549.ce0s 
PC3_ce0s. A549_ce0s, OVCARcete 
CALU6jDe&s, EB_ceDs, Lu_SC_H69 
LuJUD_H23. ajDefis* Lu_SCJi69 
293T_ce&, DU145_ceDs, S_ceQs 
PC3^cefe, HS578T„ceD3, DU145_ce2s 
Caco2, EB_cefls, CALU6_cefis 
EB.cefls, LNCaP^cefis, Caco2 
OVCARjceb. DU145_cefis, BBjdbBs 
OVCARjceSs. OVCARjsDs, U^aPjjefe - 
EB_cefe. LNCaP_cete, OVCAf^ceQs 
HT29_cefe, BT474 - cels, LlLSC J«9 
DU145^ceSs, OVTAR_cefls, LNCaP.ceSs 
DU145_cefe, MB-MDA-453, LlSC_H69 
DU145_cete, DU145.ce0s, Lu_SCjm5 
MB-MDA-453, CALU6_oefe, ffi.cels 
DU145jcefe, Fibroblasts 2, Caco2 
LNCaP_ceQs, DU145_cefe l EBjceDs 
OVCAFLceb. EBjceQs, LilSC_H345 
EB_ceBs, OVCARjceDs, 293T_ceto 
293T_cefis, DU145_ce0s, EB^ceb 
EB_ceBs, Caco2. DU145_cefls 
EB_ce2s, LNCaPjsfls. 293T_cete 
mTg.ceUs, Lu_SQ_H520, Lu_SCLH520 
Fibroblasts 2. HS578L.oeJIs, MB231_ce0s 
OVCAR_ceb, LNCaP.cefis, Ui_SC_K345 
OVCARjoeb, A549_cefe, 293Lp3Bs 
Lu_SC JC45, Caco2, OVCAfLcels 
EBLceOs, HS57aT_ceQs, LNCaP ceQs 
Lu_SC_H345, OVCAR^cefls. DU145_cefe 
l_U_SC_H69. LuJVD_H23. LilSC J<345 
293T_ceDs I Mffi31_celis, Fibroblasts 2 
DU145_cefe, KS578T_ce&, CALU6_cefe 
BT474_cefls, MB231_ceCs, HT29_cefe 
Lu_SQ_H520, MB-MDA4353, DU145_cete 
DU145_ceBs, OVCARjste, LNCaPjcells 
Lu_SCJC45, OVCAR^cefe, PRSC.con 
PC3_ce8s, DU145_cefls, HS578T_cefls 
luJVjm, LiLSCLHSO, UlAD_358 
DU145_cefls, Sjsb, A549_cefls 
PCS^ceUs, EBLcefe, MB231 jcete 
OVCAFLcels, DU145_oefis, PC3_cefc 
MB231_ceBs, CALU6_ceDs, BT474_ceQs 
MB231.ceb > BT474_cefis 1 Lu_SC 
DU145_ceSs, U.SC_H345 f OW^Rjcete 
HT29_ceJl3, OVCAR^ceUs, HMEC 
MB-MDA-435S, HT29_cefe, HT29_csfe 
lUCaP.ceJls, DU145_cefls, Lu_SC_H345 
EB_cefls, LNCaP_ceto, DU145_ce0s 
LNCaP.cefls, CALU6_cefe, 293T_cefe 
HT29_ceQs, A549_cefe, FBjroWasfe2 
EB^cefis, PC3jcefe, LuJSC_H345 
DLM45_ceCs f Lu_SCJC45, EBjsQs 
EBLceDs, LNCaP.ceHs, DU145 ceQs 
PC3^cefls, MCF7, DU145jcefls 
Caco2, EBjoeSs, Lu^SC 
Lil.SC Ji69, MCF7, OVCAR.cefe 
DU145_ceb, LUCaP_ceDs, OVCAR_ceUs 



89 



WO 02/068677 



PCT7US02/06001 



130641 AA182001 Hs.17155 ESTs 136 

109362 AA214615 Hs.194348 ESTs 3.33 

106278 AA432292 Hs23388 ESTs; Moderately similar to MI ALU SUB 133 

127241 AA321849 Hs248340 H sapiens mRNA; cDNA DKFZp564J2116 (from 

133339 N64588 Hs.71252 ESTs 132 

113260 T64696 Hs*237992 ESTs 132 

133349 N75791 Hs.7153 L-34rydroxyacyH)oenzyme A dehydrogenase 131 

107149 AA621159 Ha23284 ESTs 129 

133195 AA350744 Hs.1 81409 WAA1007 protein 129 

111302 N73838 Hs.1 5049 ESTs 129 

106414 AA447971 H&28827 ESTs 328 

121768 AA421561 Hs251664 teufn-Oke growth factor 2 (sorratomed 128 

117176 H98670 Hs.49753 ESTs; Weakly similar to hypothetical pro 128 

131320 AA171948 Hs.145698 spGdng factor (CC1 3) 3.2B 

100700 HG3227-H Guanine NudeoiirJe-Bindlng Protein Hsrl 127 

134275 AA132328 Ha3688 ac&Unducibte phosphoprotefn 126 

117667 N39214 Hs.44708 Ser-Thr protein kinase related to the my 126 

124889 R78604 Ha 101 570 ESTs 125 

126631 W95117 Ha 193337 ESTs 125 

105643 AA282069 Hs.173802 WAA0603 gene product 124 

132718 AA056731 Ha554 Sjogren syndrome antigen A2 (60kD; ribon 3-24 

116417 AA609309 Hi239302 ESTs; Weakly similar to Iflf ALU SUBFAMl 3J24 

108039 AA041341 Hs.46670 ESTs 324 

114116 Z38496 Hs.103283 WAA0594 protein 123 

124514 N5B045 Hs.142737 ESTs 122 

110802 N26651 Hs. 252748 ESTs 122 

106920 AA490899 Hs24462 ESTs 122 

123523 AA608588 Hs.193634 ESTs 321 

131564 AA491465 H&26792 ESTs 12 

119423 799544 Hs.173734 ESTs; Weakly simQar to III! ALU CLASS B 12 

128736 F03934 Hs.104607 ESTs 119 

101511 M27826 H&238380 Human endogenous retroviral protease mRN 3.18 

114509 AA043551 Hs.95249 ESTs 116 

124196 H52617 Hs.144167 ESTs 117 

129095 L12350 Hs.108623 rJrrombosponrJin 2 117 

116457 AA621367 Hs.119683 ESTs 3.17 

117040 H89112 yw25e5.s1 Morton Fetal Cochlea H sapiens 116 

129112 N32S21 Hs.108738 ESTs 3.16 

130418 J03242 H&251664 tnsuIMte growth factor 2 (somatomecfl 3.16 

131199 R80048 HsJ234433 ESTs; Weakly similar to transporter prot 115 

110357 H41529 Hs33549 ESTs; Highly similar to sufforryturea rec 3.15 

130068 AA608S03 Ha.106220 K1AA0338 gene product 115 

127423 T47546 Hs.1 19252 tumor protein; transfatealry^ntrofla 3.15 

105028 AA126719 Hs.25282 ESTs 114 

102349 U37547 Hs.75263 apoptoss inhibitor 1 3.14 

105126 AA157814 Hs^6288 ESTs 3.13 

115465 AA286941 Hs.43691 ESTs 112 

133246 AA086452 Hs.68731 triadin 3.12 

122698 AA456112 Hs£9410 ESTs 112 

123553 AA608841 Hs.111977 ESTs 112 

133437 R57419 Hs.7370 ESTs 111 

104956 AA074880 Hs. 120975 ESTs; WeaWy simgarto rrypcfoefcai pro 3.11 

116314 AA490588 Hs.43118 ESTs 111 

120562 AA280Q36 Hs.1 7391 2 eukaryofic translation initiation factor 111 

108770 AA127845 Hs.71027 EST 3.11 

129791 F02778 Hs.173887 KIAA0876 protein 3.1 

115783 AA424487 Hs.72289 ESTs; WeaWy similar to UV-1 protein [H 3.09 

107630 AA007218 Hs.60178 ESTs 107 

124339 H99093 Hs.6179 H sapiens mRNA; cDNA DKFZp566K2322 (from 

122314 AA442257 Ks.192076 ESTs 107 

104589 R79299 H&241160 ESTs; Moderately similar to fill ALU SUB 107 

115687 AA410508 Hs.183765 ESTs; Moderaterysmlr to ORF derived ton 106 

123796 AA6203SO H&247444 ESTs 3.06 

106483 AA451676 HSw30299 IGFJI mRNA^WIng proteln2 3.06 

133318 AA256168 Hs.70838 ESTs 3.05 

117244 N20979 Hs.1757 L1 eel adhesion molecule (rrydrocephaXrs 

thumbs) syndrome; spastic paraplegia 1) 105 

130797 AA430050 Hs.180948 K1AA0729 protein 3.05 

128959 D79791 Hs.107381 ESTs; Weakly simterto F38A5.1 [C.etega 3.05 

120481 AA252703 Hs.191754 ESTs 104 

126649 AA856990 Hs.125058 ESTs 103 

106970 AA504835 Hs^4252 ESTs 3.03 

126488 N34935 Hs25533 ESTs; Highly sim3ar to ARF CTPase-acfiv 103 

119498 W37226 t&55573 ESTs 3.01 

129967 H99653 Hs.138618 ESTs 3.01 

130698 AA037357 Hs.188212 ESTs 3.01 



Dm45_cen3, MB-MDA-435S, HS578TjceBs 
HT29_ce0s, Fibroblasts 2, BT474_ceHs 
EB cete, Fibroblasts 2, BT474_cefls 
132 LNCaP^oete, DU145_ceto, EB_cefc 
Dm 45 cells, EB eels, Caco2 
Ur_SCJB45, LMfcPjceBs, LilSC_H69 
Caco2, EB_ce&, OVCAR_cefe 
HS578Tjcelb, DU145_ceto, PRSC^con 
EB_cefe, LuJ\D_H23 ( LuJVD J58 
DU145_cefe, EBjalls, H$578Tj»Ds 
A549_cefis, OVCAR^cefls, PC3_cefe 
Caca2, PRSC.con, PRSCJog 
PRSCJog, CALU6jDe8s, OVCAR_cefls 
EB cells, LNCaPjcelb, DU145_ceIls 
EB cefls. RPWU LuJVD_H23 
EB eels, DU145jcelJs, LNCaP_ceils 
LNCaP cells, DU145_ceils f MB-MDA-453 
UlAD_K23, UL$C_tf69. UlSC_H345 
LU.SCJ045, OVCARjaBs, Lu_SCJi69 
Caco2, EB cefls, 293T ceOs 
CALU6_cete. OVCARjceiis. A549_cefc 
A549_cens, CALU6_cefc, 293T_cefc 
293Tjcefe. EB.cefc. Caco2 
DU1451cefls. OVCAR_cefe, EB.ceDs 
EB.cefis, CacoZ Lu_SQ_H520 
LNCaP_cefe, MB-MDA-435S, MB-MDA-453 
DU145_cefe, ffi_cefls, OVCAR_ceJIs 
DU145LC6&, LNCaP^ceib, OVCARjceiis ~ 
HS578TjceSs, HMEC (total RNA), HMEC 
EBjcefls, DU145_cefe, Caco2 
PC3_cefls, Lu_SQ_H520, Lu_SCJi69 
PC3jDefls, DU145Lce»s, Lm_SQ_H520 
EBjcefls, LilSC_H345, DU145joefe 
BT474_cefls, MB231_ceils. HMEC 
FIbrobiastsi, PRSCjoon, PRSCJog 
293T_cefls, UlSC_H345, CAUU6_cells 
OVCAFLceils, 293T^cete, EBjsBs 
EB_ce8s, Fa>robfests2, MB231_celb 
Caco2, PRSC.con, PRSCJog 
PC3_oefls, EB_cells. OVCARjceiis 
LiuSCjmS, PRSC.con, LilAD_H23 
OVCAR_ceBs, CALU6^ceils, HS578T_cens 
EBLcefls, PRSC_con, LNCaPjceQs 
LNCaPjceOs, PC3 celts, EB cefls 
DU145jce0s, HSS^T^cete. lNCaP_cefls 
EB.cefis, HS578T_cefe, LNCaP_cefe 
EB_cefis, DU145_ceHs, 293T_cefe 
LilSCLH520, UlADJ<23, PRSC_tog 
DU145_ceDs. OVCARjceiis, A549_cel!s 
EB^ceils, Caco2, DU145_cefe 
HS578T-Cefe, 293T_ceBs, Caco2 
OVCAFLceiis, Fibroblasts 2, Caco2 
EB.cefis, MB-MDA-435S, HT29_cefl3 
LNCaP.cels, DU145_ceils, EB.cefe 
LiL.LC_H460, Lu.SC J<345, LuJUD_358 
Lu_SC_H345, LuSCJi69, PRSCJog 
UlAD_358, EB.cefe, PC3_cels 
Lu_SC_H345. CALU6..ceIls, LiuSCJI69 
3.07 293T_ceDs, MB-MDA-453, Caco2 
293^1^ LNCaP jCefc, PC3_cefis 
293T_ce!ls t DU145_cefls, EB_cefls 
CacoZ EB.cefe, MB231_ce0s 
UjlSCJ(345, LNCaP.cells, DU145_cefls 
OVCAR_celis, HMEC (total RNA), HMEC 
OVCARjceiis, UlCaPjcefe, 293T_cefe 

MB231.ceBs, MCF7, CALU6_ce2s 
EB ceils, DU145_ce8s, DU145_cefe 
LNCaP.cefe, HS578T_ce& l Lu.SQJCT 
EB eels, FBMObtasts2, PRSC_con 
OVCARjCefe, LNCaP.ceHs, 293T.ceQs 
EB cells, OVGAR_ce8s, 293T_cefe 
LlAD_358, MCF7, MB231j»8s 
293T_cefe. HS578T.cefls, CAUi6.ce& 
Ul.SC JB45, UC.SCJH69. PRSCJog 
OVCAR^ceBs, LNCaP.ceOs, DU145jceHs 
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111018 
123196 
133229 
130405 
107881 



105479 
115550 
12)166 
123355 
109546 
12001 
102259 
105583 



114533 
110220 
124917 
127111 
134882 
121788 
128530 
128435 
113782 
1 27569 
109642 

14615 



113947 
129455 
107772 
127159 
I24792 
109751 
128926 
I06637 
132164 



104818 
1 26050 



N54067 K&3628 mfcgefKafoated protein kinase kinase 331 

AA4S9250 Hs^3403 sate pato ffl pyft an s fa asas sufcunS 0 3 

AA203433 Hs3834 WM1014 protein 3 

Hs.155396 nuclear factor (oytoitaerived 2)44 3 

Hs.61273 H sapiens chroraoscrra 19; cosraUR32611 3 

Hs.17132 ESTs 3 

H&23467 ESTs 239 

Hs3Q575 ESTs; Moderatefy similar to Ml ALU SUB 239 

Hs.151411 KIAA0916 protein 238 

Hs.160657 ESTs 238 

1&26954 ESTs 237 

Hs.107812 ESTs;WeaWsfs&nfertoproi»fichpio 237 

Hs32222 sema domain; fcnmunogfat^ domain (Ig); 237 

HsJ24549 ESTs 236 

Hs3337 transmerobia!«4superfajn^ 236 

Hs.177526 ESTs 238 

H&27090 ESTs 235 

Hs.75470 hypoft^calproto; exposed in osteo 235 

Hs-220509 ESTs 234 

Hs3C538 ESTs 234 

Hs,178113 ESTs;^&Kte^ate^ysimi^toknesjn^i3® 234 

Hs.183475 Hsapfens done 25061 mRNAseq 234 

Hs.147112 ESTs 233 

Hs,15342 phosphate (^dytyttransf^ase 1; chofin 233 

Hs.191783 ESTs 233 
HsJ22394 ESTs; W^Wy similar to weak similarity 

pfctehUS)1[Cxtegans] 232 

Hs.159456 DKFZP566F123 protein 232 

zn52d12^Stratags»rnusdB 937209 Hsa 232 

Hs.141742 ESTs 232 

Hs.187991 DKFZP564A122prcte*i 231 

Hs,40515 ESTs; Weatdysbnflartolin ALU SUBFAM1 231 

1&237955 RAB7; member RAS oncogene family 231 

Hs.132784 ESTs; Weakly similar to cDNA EST EMBLT0 2.91 
Hs.6679 HsapjensmR^cDfttDKFZp^A0424(fiom 

Hs.107213 ESTs^H^siiTiilartoNY-REhWan^Bn 23 

Hs^50824 ESTs " 23 

Hs,41270 procofiagen-tysine; 2-axogtutarate 5-db 23 

ESTs 23 

H<k24631 ESTs 29 
Hs.75860 hydroxya^l-Coenzyme A dehydrogenase/3-k 



AAQ25557 
D59570 
AA255546 
AA393812 
AA350690 
AA504773 
F01449 
AA446946 
U28369 
AA278907 
M90857 
AA053401 
H23543 
R91241 
AA805726 
N73762 
AA423968 
AA504343 
A13012D1 
W15580 
AA588536 
F04465 

AA083812 
AA086320 
W84768 
W27301 
AA018587 
AA284097 
R44357 
F10210 
AA481403 
AA459961 
U84573 
AA905327 
AA034947 
H27267 



16696 F09780 
135204 AA421148 
134946 AA406534 
114975 AA250850 
113792 W35212 
102322 U34962 
125642 A1096849 
100288 D43951 
105878 AA400184 
I25262 W88755 
114419 AA011448 
1 30639 D59711 
13(872 AA370302 
126906 H66949 
121807 AA424507 



-Coenzyme A hydratase (trifundbnal pro 
EST 



239 
239 
2.89 
238 
238 
238 
238 
238 



105474 
122348 



135143 
106711 



132139 
114484 



1 00403 
1 3795 



IQ2662 
J32232 
106111 



I22783 
112788 



AA255440 

AA443695 

AA521186 

AA102644 

AA464741 

132832 

AA213410 

AA03437B 

N74051 

D85527 

AA448627 

T70214 

U70321 

AA252030 

AA421638 

C13961 

AA459895 



Hs.66124 

Hs.183418 cefldMstencycte2-BBl{PimREpr 
Hs.193053 ESTs; Weakly sfrnter to Kwi [H.sapfens] 
Hs.13944 aorenerojc; bet* receptor kinase 2 
Hs.17691 ESTs; Weakly state to env protein pis 
Hs.54473 cartfiafrspecffichameoboot 
Hs£5274 ESTs; Moderately sim&rto putative sev - 
Hs.153834 Human mRM far K1AA0O99 gene; complete cZ88 
Ha24656 KIAA0907 protein 2.88 
Hs.108514 ESTs; Highly similar to Trfo [ftsaptens] 238 
Hs.106532 ESTs; Weakty sim3ar to transposon LRE2 2.88 
Hs.17132 ESTs 237 
H&21739 H sapiens mRNA; cDNA DKFZp586l1518 (from 
Hs.168089 ESTs; Highly simiar to CALCIUMS WDING 237 
H&247478 H sapiens Mut S homotog 5 gene; partial 
1C7; LST-1; iymphotoxfa beta; tumor necr 
Hs219614 Wwx protein FBL11 
Ha231476 ESTs 
Hs34217 ESTs 



KIAA10S6 protein 

Hs.143187 Human DNA from 

Hs.101842 AT-WrKflngtranscr^«^1 
Hs.111554 ADP^ibosytaiirjnfactDrfi®7 
H&252351 He^V-HLTR-assodafing2 
Hs/194092 ESTs; Weakly simiar to III! ALU SUBFAM1 
H sapiens mRNA for UM domain, partial c 
Hs.125163 ESTs; WeaHy siite to LEQ AUJ SUBFAMI 
Ks.183548 ESTs 

Hs.130227 tmiorriecn^sis factor receptor sui)e^ 

H3.42640 ESTs 

Hs3451 ESTs 

Hs^10115 EST 

Hs38988 ESTs 

Ks.163630 ESTs 



PC3_cefls, Caafl, HbrobJasts2 
ULSCJB45, BT474_ceIfe, Lu.SCJi69 
OVCARjceDs, SaTjoells, ffl.cefc 
PRSCjrai EBjeQs, DUW^ceSs 
UlSQJ&SK MB7, IMJDJ58 
EB_ceOs, A549 jeb. HS578T_ceQs 
Ln_SCJi345, PC^cete, OVCAR_ceHs 
EBjceDs. U_SCjm fibroblasts 2 
LNCaP.cefa, BLceOs, 293Tj*Bs 
PRSCjxm, PRSCJog, PRSCJog 
Lo.SC J045, HT29_cefc. BT474_cefc 
EB_cefls, UiJDJQZ, UlADL.358 
EELoeDs, MB23LceHs. OVCATLceOs 
EB_cete, DU145_ceDs, 293TjceQs 
A549_ceb, PC3jcefts, DU145_cefe 
293T_cefis, LujXJW60, PC3_ce& 
PRSCJog, ULSCJ045, MB231_cefc 
ULSCJ045, lHJSCJ«9 ( PRSCJog 
H3578T_ceQs, 293T_ceBs, 293T_ceQs 
EB.cefe, MB-MDA-453, Ftoobhsts2 
HT29j3efis» CALU6jcefc. HMEC 
DU145joefls, UcSCJ^ Caoo2 
EB^cefe, Ui^SCLHSZO, PRSCjcon 
EBLcefis, UlAD JC3. PRSCJog 
EBjrfs, HS578T_cete, LuJUD^358 

PC3_ce&, EB.ceBs, OVCAR.ceDs 
A549_cefe, HS57BT_cefe, PRSC.con - 
U_SC Ji69, U|JSCJj345 v Q jsOs 
DU145_ceis, RbrobJaste^ MCF7 
OVCAR_ceb, DU145_ceBs, CALU6_cefe 
OVCAR^ceb, EBLcefis, PC3_cefls 
293TjceDs, OVCA^cefls, PC3_cefls 
DU145.03QS, DU145__cefls, CALU6_ce0s 
231 EBjseb. Lu_SCJ^, 293L.ceDs 
CALUe.ceOs, EB.ceSs, OVCAFLcefc 
EB_ceUs, Caco2, mUDM35s 
DU145joefis, HS57aT_ceHs, A549_cefc 
MCF7, HMEC (total RNA), 293T.celb 
EBLcefis, Lu LtC JW60, 293T.ceDs 

LNCaPjceBs, DU145_ceils, OVCAR_cefe 
CALU6_ceb > 293T.ce8s, 293T_ce& 
PC3_cefls, EB.ceDs, LNCaP.ceDs 
EB_ceOs, LNCaP_ceQs, Caco2 
EBjcefls, ffi eels, EBjceDs 
MB-MDA435S, U.SC JB9 f CALU6_cete 
293T.csSs, HT29_ceBs, U_AD Jt23 
PC3_cefls, CALU6_cefls, 293T_cefis 
293T_ceQs l LNCaP_cefls, EB_cefis 
OVCAR_cels, DU145,ceHs, 293T^cefe 
DU145_cefls, HS578T_ceto, lvB231_cefe 
EB.ceOs, UlADJ423, Fibroblasts 2 
EB.cefis. A549_ce&, OVCAr\.oefe 
237 293T_ceBs, A549_ce3s i LuJjCJMSD 
UuSC J645, UlSC Jffi9. LNCaP_ce& 



237 


U_SC_H69, HT29 jslls, RPWE_2 


237 


UlAD J123, Caco2, EB_cefe 


237 


HT29 UlSC J«9. BT474_ce0s 


Z86 


MB-MDA453, OVCAR_ce&, Lu_SC_H59 


236 


PC3_cefis, EB_ceQs, 293Tjceb 


co 236 


EBjceOs, luJDJUZ Lu J.CJW60 


235 


LNCaP.cefls, Caco2. S_ce& 


235 


A549^ceSs, HS578T.cefis t Caco2 


235 


PC3_cefls, UuSQJiSa), MB23Lce& 


235 


ULSCJ1345, MB231_cefls, Fibroblasts 2 


234 


ULADJ358, Lu_AD_^, Iiffi231_ce!ls 


234 


LLSCJ045. OVCARjceOs. PC3_ce& 


234 


DU145_cans, DU145_cefe, OVCARjceOs 


234 


EBjceDs, UlADJ% Rbrobbsts2 


234 


EB^cefla, OVCAR^ceBs, U_SC_H345 


233 


EB_ceSs. Lu JjC JH460, OVCAR^cefe 


233 


DU145,cefis, LNCaP_C8lis, Lu_SCJi345 


233 


EB.cefls, MCF7, LlSCJ«9 


Z82 


DU145_cefe, Lu.«JJC45, EBjceDs 
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120823 AA347546 Hs.185780 ESTs Z82 

100378 DM09 Hs.10848 WAAM87 gene product 232 

114677 AA114163 Hs.1 88377 ESTs 231 

108085 AA045602 Hs.62863 ESTs; ModerateV sfrrto to serfne^reo 231 

104938 AA084627 Hs.18341 ESTs; Highly similar to CGl-72 protein [ 231 

128743 AA237013 Hs7730 heterogeneous nuclear ribonucleoprotein 23 

124314 H94877 Hs^15766 GTP-ciraDng protein 23 

1342Z7 D79S86 Hs.80338 WAA0164 gene product 23 
122922 AA476268 zw44h1.s1 Scares JoteUetusJ4a2HF8_9w H 

contains Atu repetitive etemenfcccntaln 2.79 

120)96 H42968 Hs.155606 paired rnesedennhomeo box 1 2.78 

1292S5 AA424782 Hs.1 10121 SEC7homotog 2.78 

116155 M460957 Hs.76053 DEAD/H (Asp-GIu-Ala-Asp/His) box poJypep 2.78 

105911 AA401809 Hs.189910 ESTs 2.77 

119232 T03475 Ha258624 EST 2.77 

131168 AA482C07 H&23788 ESTs; WeaWy slmfcr to homology with Is 2.77 

106048 AA416697 Hs.15330 ESTs 2.76 

124352 N21626 Hs.1 02406 ESTs 2.76 

129349 D86974 Hs.1 10613 KIAA0220 potefi 276 

106120 AA423808 Hs3765 RNA heficase-reteted protein 276 

100643 HG2755-H T-PtasOn 275 

128500 U60521 Hs.100641 caspase 9; apoptosis^elated cysteine pr 2.75 

126090 R44789 Hs.1 19485 ESTs; Weakly simter to rostral cerebell 275 

127064 Z43709 HSC1JA091 normalized Infant brain cDNA H 2.75 

132989 AA48Q074 Hs394 adrenorneduSn 2.75 

108888 AA1356Q6 Hs.1 89384 ESTs; WeaWy simBar to llll ALU SUBFAMI 2.75 

119579 W42429 Hs.150507 ESTs 274 

100387 D83777 Hs.75137 KJAA0193 gene product 274 

114744 AA135407 Hs252351 HERV-H LTR-assorialing 2 274 

129092 AA011243 Hs.63525 pory(rC)-blnding protein 2 274 

125360 AA677978 Hs.1 89741 ESTs 274 

107874 AAQ253Q5 Hs^5218 ESTs; lAfeaJdjr stmOarto reverse transcri 274 
114086 Z38266 Ks.12770 H sapiens PAC done OJ0777O23 from 7p14- 274 

116180 AA463902 H&34964 ESTs 273 

126027 M61982 ESTs 273 

116339 AA496257 Hs.72165 ESTs; WeaWy sinter to R26984J [Ksapi 273 

105387 AA236951 Hs.1 08836 chromosome 1 open reaifing frame 9 272 

111359 N91273 Hs37179 ESTs 272 

106680 AA461458 HsJ247B9 ESTs 272 

118598 N69136 H&214343 ESTs 272 

107913 AA027161 Hs.59523 ESTs; Highly similar to G1 TO S PHASE TR 271 

134315 AA136269 Hs31648 ESTs; WeaWy slm3ar to S164 [Haptens] 271- 

135233 AA127463 Hs3683 proteWdnase; irterferoiHndudbte dou 271 

112932 T15470 Hs.189810 ESTs 27 
119053 R11501 yf28f1^1 Scares fetaJ fiver spteen 1NR 

contains AJu repetitive element, mRNA 27 

131206 AA044O78 Hs24210 ESTs 27 

126759 AA063642 ESTs; Highly similar to (define not ava 27 

131060 AA160890 Hs32564 myosin VI 27 

132135 N69101 Hs.40730 ESTs 27 

120835 AA348446 Ha.96906 ESTs 27 

113815 W45311 Hs.14756 ESTs 27 

133234 T90092 Hs.6853 ESTs; WeaWy similar to !lil ALU SUBFAMI 269 

126819 AA305536 Hs.161489 ESTs 269 

125198 W69474 Hs^25550 ESTs 269 
108394 AA075144 zm86f&s1 Stratagene ovarian cancer (#93 

gfcX1664 TRANSLATIONAILY CONTROLLED TUM 

134456 X58405 Hs.83532 membrane cofactor protein (CD46; trophob 269 

111720 R23739 H&23585 WAA1078 protein 268 

114617 AA084148 Hs.110659 ESTs 268 

127787 AA731764 ESTs; WeaWy simter to till ALU CLASS C 268 

101437 M20681 Hs.7584 sohrte carrier famDy 2 (fadHatedgki 268 

133761 AA477223 Hs.75922 brain protein 13 " 268 

105869 AA399574 Hs.19086 ESTs 268 

125191 W67257 Hs.138871 ESTs; WeaWy simter to D1I ALU CLASS B 268 

116238 AA479362 Hs.47144 DKFZP586N0819 protein 267 

124770 R40555 Hs.120429 ESTs 267 
101764 M80563 Hs.81256 S100 caJdurrvtindlng protein A4 (cakium 

murine placenta! homo tog) 267 

130897 AA063428 Hs^1022 adaptor-retted protein cornptex 3; beta 267 

133303 H61046 Hs.237352 EST 266 
124724 R12405 Hs, 112423 H sapiens mRNA; cDNA DKFZp586J1420 (From 

123697 AA609601 Hs^21224 ESTs 236 

111548 R09170 H&258707 ESTs 266 

107005 AA598679 Hs. 19421 5 ESTs 266 

105569 AA278399 Hs3Q596 ESTs 265 



HT29_cefe. HMEC (ferial RNA), BT474_ceto 
Cacc^ PC3_ce2s. OVCARjceBs 
DU145_cefis, MCF7, EBLceQs 
EBjsSs, hiJiDj&X HT29Lceb 
PC3_ce8s, HS578Lcete. OVCAILcete 
OVCARjsOs, UlCaPjsSs. Caco2 
LNCaP.ceQs, DU145JS&, Caco2 
LNCaPjsils, A549 jefe. BjaQs 

LuJSC JC45, OVCAR_cefe, Lu_SCJ*69 
LuJVD_H23, Lu_SCJi69, UlULH460 
LilAD_H23, EBjsSs, LiL$CJi345 
EBjceSs, OVCAILcefe, 293T_ceHs 
293T_cefc, HS578T_ceDs. DU145_ceSs 
EB.cefls, UUD JW. liLAD J58 
EB_ce8s, LhJjCJ4460, MCF7 
OVCAR^ceDs, U.SCJB45, 293T_ce0s 
MCF7, M&MDA453, CALU6j*Hs 
DU145__cetts, HT29_cete. Lu_SC_H69 
OVCARjxDs, EBjxDs, 293Tjce0s 
293Tj*te, PC3_cens, HS578T_cefc 
UlAD.358, LuJSC_H69, Lu_SC_H345 
Lu_SC _H69, U_SCjm5. BT474_cefc 
CacoZ A549_ceBs, HT29cefe 
EBjsDs, OVCARjsfe, DU145_ce0s 
OVCAR_ceb, LNCaPjceHs, DU145_C8te 
293T_ce&, Dtr145.ce0s. PC3j»Bs 
CAUiejaBs, OU145_cefe, Caco2 - 
PCSjcefls, Lil_SQ_H520. RPW^_2 
EBjce&s, MCF7, DU145_cefls 
LuJU3_358, UlAD_358 p PRSCJog 
Lu_SC_H345, LuJ.C JM60. HT29_cete 
EBjceDs, IWCaPjceBs, BT474_cefts 
Lo_XJi69. PRSC.con, UiJ\DJ423 
UCaPjsBs, DU145_cete, A549_cefe 
EB^cefe, DU145_cefe, OVGAR_ce!!s 
PC3^ceDs, EBjceQs, Caco2 
EB^cefls, LNCaP.cefls, 293T„ceDs 
PC3_ceHs, LnjSCJ<345 t Caco2 
U&mMSa, 2937.0303, HT474_ceQ3 
EB_cete, MCF7, UlSC_H345 
EBjsSs, 00145.06113, HtCC 
EB.ceQs, OVCAR^cete, Caco2 
293T_cete, UiJDjVi, PC3.celIs 

U.SCJC45, LiLSCJi69, DU145_cete 
Caco2, Lu_SCJ^345, HS578T_ceBs 
LNCaP.cens, LlSC J045, UlSC_H69 
LNCaPjceBs, MCF7, HT29jk& 
EBLcete, 293T_cefe, O VCAR_cefis 
Fibroblasts^ CALU6_cefe, RPWE_2 
EBjceBs, PC3_ceUs, DU145_cets 
U_SC_H345 f OVCAR^cefls, DU145_cete 
EB_ceto, DU145_ceQs. Caco2 
Lu_SCJH345, LLADJC3, LuJ\D_H23 

269 HMEC, HMEC (totaJ RNA), Fibroblasts 2 
EB_ceite, LNCaP.cefe, DU145_cefls 
PC3_cefe. HMEC (total RNA), OVCARjsSs 
DU145_cefls, LNCaP^caDs, OVCARjsfe 
HT29_ceIs, LilSCJI345, MB231_ceDs 
03002, LLLCJ4460, FibroWasts2 
EB.oeQs, LiLAD LH23, U.SCJC45 
PC3^cete, MCF7, MB231^ceQs 
OVCAR_ceas, DU145_ceis, LNCaPjceOs 
OVCARjceHs, DU145jcete, LNCaPjceOs 
ULADJC3, U_SC_H69. PRSC_Con 

A549_ceBs, MB231_ceUs, OVCAFLceNs 
EB^cefis, LILADJH23. HMEC 
ULSCJ045, Lu_SCjm PRSC.con 
266 Lu.SCjm5, BT474_ceO S| OVCAfLcelis 
OVCARjcalis, 293Tjce«s, La_SCJ«9 
»3T.ceD3, CALUS.ceHs, A549 -C eDs 
Lu.SCJi345, OVCAR^ceOs, UlAD_K23 
MCF7, HT29_ceHs, BT474.cefls 
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104105 
121335 
124353 
124253 

12am 

129535 
131397 
130175 
127507 
105377 
114571 
133726 
132380 

127986 
116208 
1X946 



KS9742 

AA481549 

A A608S5 2 

AB0Q2338 

X7K93 

AM 88445 

AA238702 

AA1 12679 

W19933 

H68018 

AB70418 

AA476333 

AA089458 

AA463234 

M31606 

AA032084 

R37495 

AAQ29973 

AA600357 

AA056181 

D52154 

H80181 

AA504110 

AA133527 

N24020 

AA424813 

AA055560 



ABC02301 Hs54985 KIAAQ3TC protein 
AA422123 Ks.42457 ESTs 
AA404418 Hs.144953 ESTs 

Hs.172330 ESTs; Weatfy to WiskoMtti^ 
Hs.102201 ESTs 
Hs.165694 ESTs 
Hs.1 12603 EST 

Ha 26395 erythrocyte membrane pjoteiu band 4.1-0 
Hs.151536 RAB13;rrenTberRAScnccgerBfem3y 
Hs.152618 ESTs 
H&24371 ESTs 

H&2S2291 ESTs; Weatfy sfm2ar to HO ALU SUBFAMI 
Hs.75761 SFRS protein kinase 1 

yr7Qi05j1 Scares fetal Over spteen 1NF 
IMAGE211257 ff, mRNA seq. 
Hs.192050 ESTs; WeaJdysaniartoDDAUJ CLASS A 
Hs.42532 ESTs 

H&21490 WAA0438 gene product 
Hs.119387 WAA0792 gene product 
Ha 196177 phosphaytasetinase;ga!nnH2(tes&s) 
Hs.124841 ESTs; Moderately similar to transfomi^ 
H&23578 ESTs 

Hs.1 07979 small membrane protein 1 
K&239489 T1A1 cytokBdcgranute^ssodated RNA-W 
Hs.17311 DKFZP434N161 protein 
Hs.172458 Huronate 2^ulfatase (Hunter syndrome) 
ESTs 

Hs.1 8063 ESTs 

Hs.126925 ESTs; WeaJdysntotoTheKlAA0138 gen 
Hs.132913 ESTs 
Hs2S692 ESTs 

Hs.13179 ESTs; Moderately simQar to QQ ALU SUB 
Hs.1 14062 ESTs; Weakly simBar to T15B7.2 (C^ega 
AA497013 Hs.188740 ESTs; Weakly am2arto!ItJ ALU SLBFAM1 
R07785 yf15c06n Scares fetaJ &ver spteen 1 NF 

contains Ahi repetitive etementcontah 
ESTs 

KIAA0077 protein 

ATP^wnding cassette; sub^amiy C (CFTR 
Fas-acfivated senne/threcnha kinase 
ESTs; WeaWy stmliar to C13F10.4 [C.eteg 
ESTs 

Hs.157082 H sapiens PAC done DJ0S88G15 from 7o33- 2.56 
Hs.1 77407 ESTs; Weakly similar to !fll ALU SUBFAMI 156 
yy05b07j1 Soares metenocyte 2M>HM H sap 156 
ESTs; Moderately sJmfer to histamine N- 156 
ESTs; Moderately simfar to rsj-1 prate! 
ESTs 

ADP-nbosyiafen factor-fte 5 
CD2-assotiated protein 
activating transcription factor 5 
ESTs 

heat shock 70kD protein 6 (HSP7QB 1 ) 
fight junction proteh 1 (zona occluders 
Hs.1 14034 matemalGIO transcript 
H&S8722 ESTs 

trariscnpflon etongatfon factor A (Sll); 
ESTs 
ESTs 
ESTs 

ribosornai proteh S7 
Hs.193977 ESTs 

Hs.83883 HurnanDfMseq^ctone7iaJ7OTcruonK) 

pru)sphcerx)^yruva^ 153 
D43682 Hs£2208 scyU^ers^kc^iog^^^^ 153 
U60975 HtrnianrrybrM receptor gp# 153 

D63478 Hs*127 WAA0144 gene product 153 
AA258063 H&23438 ESTs 253 
RQ2401 H&221078 ESTs 253 
AA398933 Hs.172613 solute carnerfemBy 12 (pcifassrumfchb 152 
AA116036 Hs,9329 dvomosome 20 open reaolng framel 152 
W39128 Hs^47901 Human DNA seq from cione8B1 on cfromoso 

-CELL MEMBRANE GLYCOPROTHN PC-1; me ge 
135334 AA053134 Hs241558 ariatme-2 (D. rnebnogaster) homoJog (all 152 
128538 R44214 Hs.101189 ESTs 252 
109865 HQ2566 Hs.191268 H sapiens mRNA; cDNA DKFZp434N174 (from 



101551 
114473 
111863 
129018 
107058 
126175 
131979 
126122 
106961 
114730 
117342 
131622 
104904 
117359 
123331 
125324 



129813 T33462 
100265 D38521 
134890 T40902 
133562 AA421874 
135011 H73161 
107226 D58185 
126042 H62441 
114472 AA028924 
126291 N42090 
113349 T79021 
105769 AA347485 
110918 N46423 
117170 H98153 
105159 AA173981 
105726 AA2S2328 
132079 H67964 
131813 X51757 
133538 L14837 
124981 T40849 
122028 AA431306 
122487 AA448332 
119315 T41152 
107957 AA031948 
122457 AA447780 
103572 Z25749 
124395 N29963 
116024 AA451748 



134361 



105519 
124684 
105852 
105012 
126534 



Hs.12600 
Hs.75935 
Hs.90786 
Hs.75087 
HsJ£991 
Hi21945 



Hs.14438 

H*25477 

H&24283 

Hs.42500 

Hs.30430 

Hs£754 

H&38694 

Hs3268 

Hs.74614 



Hs.80598 
Hs£0485 
H&57548 
Hs.96418 
HsJ5538 



165 
165 
165 
164 
164 
164 
164 
164 
164 
163 
163 
163 
163 

162 
162 
161 
16 
159 
159 
159 
159 
159 
158 
158 
158 
158 
158 
158 
158 
157 
157 
157 
157 

157 
157 
157 
157 
156 
156 
156 



156 
156 
156 
155 
155 
155 
155 
154 
154 
154 
154 
154 
154 
154 
154 
154 



HMEC {total RNA), HMEC. LNCaP.ceOs 
U.SCJC45. Lu_SC_H69, DW45_cefls 
QjceHs. Fftrottasts2, DW45_ceDs 
LlLSC J69, 293T_oe&. S_ceDs 
DU14SLCB&, OVCARjsBs, LnJSC_H345 
HLcefes, UlSC_J!69. Lu_SCJC45 
EB.ceBs, UlAQJEB, FIbrobfasts2 
EB.cefe, DU145_ce&, Caoo2 
FforoWastel PRSC.con, HS578T_ce8s 
ajsDs, Lu_ADJt23, UUCJW60 
Cacol HLcefls. CALU6 ceDs 
EBjceib, DU145_cefe, Caoo2 
EB_ceDs, LilAD JE3, U.SCJE9 

EB_ceOs, UlAD JC3, Lu_SCJffi9 
DU145LceQs, OVCARjceOs, LNCaP_ce2s 
DU145_cefls, PRSC_con, Ffcrdrfas1s2 
LNCaP_ceDs, DU145_ceDs, HS578T_ce8s 
EBjceOs, N&MDA453, Caco2 
L^CaP^ceSs, EB.ceSs. MB40V453 
DU145_ceDs, Caco2, OVCAR_ceas 
KT29_cefe. MB231_ce2s, Lu_SQ_H520 
A549_cefis, EB.ceSs, KS578LceQs 
DU145_ceIls, UJSCLH345, EBjceOs 
IolSC_H345, DU^.cefc, LNCaP_cefe 
DU145_ceDs, PC3_ceDs, A549_ceQs 
DU145_cefls. OVCAILcells, LNCaPjceQs 
HhCC, DU145jDefiS, DU145_ce& - 
DU145_cefls. LNCaP_ceDs, MCF7 - - 

HS578T^cefe. DU145,ceQa, WCaP.cefls 
PRSCjcon, PRSCJog, HS578T_cete 
Lu_.SC.H345, Lu.SC Jt69 f BT474.ce& 
HS578T_cefis, PRSCjcon, EB_ceOs 
Lu.SCJtt9, Caco2. PRSC_con 

EB^cens, Lu_AD_H23. Fibroblasts 2 
Lu_SCJH345, 293T_ceHs, Lu_SC_H69 
EB.cete, LNCaP^ceDs, PC3jDels 
A549^cefls, DUl45_ceHs, EB_ceOs 
EB_cefe, luJM _H23, UlAD_358 
EBjceOs, LNCaP^ceOs, MB-MDA453 
Lu_SC_H345, LiL.SCJi69, HKEC {total RNA) 
HMEC (total RNA), HMEC, RPWEJ2 
Lu_SC_H345, LIL.SCJK9, DU145_cefc 
HMEC. HMEC (total RNA), PC3_cefis 
KT29_ceto, PRSCJog, LIL.SCJ045 
Lu_ADJ^, RPW^, LujSQ_H520 
EB.ceOs, CAUJ6.cefls, DU145_ceDs 
OVCARjceDs, EB.cefe, LNCaP.celb 
IrtCaP.cefe, EBjceOs, DU145_ceTJ3 
MCF7, ffi.ceDs, M&M)A-453 
EB.ceSs, DU145jceDs, HS578T^ceto 
UUMJUX MB23Lcete f F2xofatests2 
DU145_ceDs, Caco2, A549jceQs 
EB.ceSs, Caco2, LNCaP cefe 
Fibrobbstsl BT474^ce8s! HMEC (total RNAJ 
LU.SCJH345, MCF7, MB^)A453 
Lu_SC_H345, MBWM35S, PRSCjcon 
A549.ceb, RPWEJ. DU145_ce& 
DU145_ceHs, EB.ceUs, A549jcels 
EBjcefis, CALU6Lce2s, DU145_ce0s 
HMEC (total RNA), HMEC, RPW^J 

LNCaP_cefc, RPWRJt, MBM)A453 
LNCaP.ceSs, CALU6_ce0s, DU145_ce& 
EBLceQs, HMEC (total RNA), Caco2 
BT474.ceDs» HT29_ce2s i LjlAD_358 
SLceHs, Caco2, M&M5A435S 
U_SCJ045 t OVCAR_ce«s, Lu_SC_H89 
IKCaPjceDs, DU145_ceBs. SjceDs 
CALU6jceHs, Caco2, DU145_cefe 

152 BT474j»&. LNCaPj»Ds, Lujtf) JVS 
293L.ce0s, CALUB^ceSs, DUl45_ce2s 
EBjcefls. ULADJB3, U.SCJi345 
152 DU145Lcefls, LNCaPjseBs, OVGARjafe 
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118579 
117590 
104340 
122455 



123258 
118487 
105044 
107480 
111760 
132474 
103423 
123488 
100475 
112003 
114315 
105291 
135354 
107521 
108373 
108836 
110386 
129658 
134283 
101844 
133461 
103545 
128440 
134992 
116295 
107004 
132137 
12639) 
113050 
101667 
108569 

117186 
129091 
128468 
117498 
105407 
128941 
116486 
134869 
130664 
107985 
110300 
113471 
131474 
120791 
133733 

119977 
134921 
132295 
133395 
106728 
116370 



N689Q5 

N34904 

F15201 

AA447744 

AA211901 

AA490929 

N66763 

M416546 

W58057 



N68018 

X97249 

AA599708 

D90276 

R42547 

Z41QZ7 

AA233311 

AA188934 

X78262 

AA074393 

AA132061 

K45516 

M22348 

H12661 

M93425 

M33318 

214000 
N76763 

H05625 

AA489016 

AA598675 

AA2B2312 

W28286 

T26366 

AA085398 



AA044622 

T236S 

N31726 

AA243478 

R55763 

C14128 

T35288 

R09049 



128862 
111614 
111993 
131554 
130983 
131654 
109)14 
106300 
102386 
112517 
125375 
123808 
114950 



H37620 

T87174 

U28749 

AA342802 

AA416973 

W88579 

W60186 

H66351 

AM91296 

AM65355 

AA521256 

W81552 

R81297 

R12581 

R42241 

AA100026 

N71215 

AA49705O 

AA121123 

AA435840 



Hs*9141 

Hs£6430 

Hs.105274 

Hs.43080 

Hs.149436 

Hs.74304 

Hs^21434 

Hs.180930 

Hs.123122 

Hs.187764 

Hs.12 

Hs.172551 

H&26297 

Hs28752 

Hs,99367 

Hs.61950 

H&222727 

Hs33268 

Hs.131255 

Hs.8107 

Hs.62 

Hs.183584 

HSJ5384 

HS32414 

H&91216 

Hs239475 

Hs.4076 

Hs.100090 

H&22711 

Hs.79110 



Hs.42612 

Hs.183755 

Hs258674 

Hs.44268 

Hs3206 

Hs.107287 

H&251980 

Hs.90421 

Hs.17625 

Hs.71968 

Hs.124147 

Hs.16341 

HS2726 

Hs.194031 

Hs.75798 

Hs.124744 

Hs.169487 

Hs.181042 

Hs.72805 

Hs.153768 

Hs336204 

Hs.83623 

Hs.106673 

Hs.191146 

Hs.106359 



H72971 
AA620552 
AA243503 
H39216 



103408 
129703 



AA401348 



HSJ21662 

HS3Q204 

Hs.191374 

Hs.19114 

K&81728 

H&23721 

K&25682 

Hs.11801 

HSJ239970 

Hs.198252 

Hs.179999 



smaJ Inducible cytoWne A5 (RANTES) 251 

ESTs; Moderate^ «artoDD ALU SUB 231 

ESTs 231 

EST 231 

ESTs 231 

ESTs 231 

ESTs 231 

fdrtfisfri family member 5B 231 

peripkkta 23 

ESTs 23 

TBP-assodated factor 172 23 

FSH primary response (LRPR1; raf) homoto 23 

ESTs; WeaWy similar to ill! ALU SUBFAMI 2.49 

cajctooembryonic antigen-related ceD ad 2.49 

ESTs 149 

ESTs 2.49 

ESTs Z49 

ESTs 2.49 

HsapIsnsmRNAforTRB 2.49 

ESTs; Weakly stm8ar to nuctear protefn 2.49 

ESTs; WeaWy simSar to ubiquitous TPR m 2.48 

ESTs 2.48 

ubquinok^tochrome c reductase binding 2.48 
H sapiens mRNA; cONA DKFZp586B091B (from 

protein tyrosine phosphatase; non-recept 2.48 

cytochrome P450; subfamfly UA (phenobar 248 

ring finger protein 1 2.47 

ESTs 247 

ESTs 2.47 

ESTs; Highly ssmfer to partial CDS; hum 2.47 

ESTs 247 

CTD(cait»xy^emTiriaiacma^RNApo^ 2.46 

li Imnn mi 1 O AC 

WaaSpSn J Z,4o 

EST; WeaWy ssnflar to 60S RIBOSOMAL PRO 

nudeoSn 2.46 
zn7e3.s1 StratagenehNT neuron $937233) 

IMAGE346748 J, mRNA seq 2.45 

ESTs 2.45 
Human Chromosome 16 BAC dona CfT987SK-A 

EST 245 

ESTs; Highly simiar to myelin gene expr 2.45 

ESTs 245 

ESTs 244 

EST 244 

ESTs; Moderately stotoflfl ALU SUB 244 

ESTs 244 
H sapiens mRNA; cDNA DKFZp564H>53 (from 

ESTs 244 

ESTs; Moderate^ simCarto till ALU SUB 244 

high-mobSty group (rwnhistone chromoso 244 

ESTs 244 
Human DNA seq fiom done 1183121 on chro 

to predicted Gy and worm protefos. Con 243 

ESTs 243 

Kreister (mouse) rna^^eJatedteitaie zip 243 

Dmx-te1 243 

ESTs 243 

U3snoRNP-assodated 55^ protein 243 
ESTs; Moderately samBar to NUCLEAR PORE243 

nuctear receptor subfamSy 1; group I; m 243 

eukaryotte translation initiation factor 243 

ESTs 243 

ESTs ; 243 

ESTs; WeaWy similar to PROTEIN-TYRQSINE 

NOC^ssodatedprote&i1 242 

ESTs 242 

ESTs 242 

h^THnobity group (nonhistone chromoso 242 

unci 19 (C.efegans) hcmolog 242 

ESTs 242 

WAAQ277 gene product 242 
ESTs; WeaWy stmfiar to PHOSPHATIDYLETHA 

adenosine A2b receptor pseudogene 242 

ESTs; Weakly simflar to ZNF91L Jlsapfen 241 

G protem-coupted receptor 9 241 

ESTs 241 



Lu_SC_H345, LNCaP.celb, Lu_SCJ«9 
Lu_SC_H345, DU145jce0s, Lu_SC_H69 
UlSC JB45, PRSC.ccn, PRSCJog 
Caco2, LUJSCLH69, 293T_pefe 
EBjcate, DU145_ce0s, CALU6jsSs 
EBjcete, LtLAD_H23, USC_H69 
CALU6_cete. HS578T_cefe, OVCARjsfe 
EB cells, Caco2, DU145 eels 
Caco2, OVCARjefls, HMEC (total RNA) 
LtLADJi23, EB.cefe, UlADJ58 
LNCaP_ceto, EBjoefe, DU145jceb 
HS578T_ceDs. U_SC_H345, PC3_ce!ls 
OVCAFLceBs, LuJSC_Ji345, DU145_cete 
MB-MDA453, 293TjceGs, CALU6_cete 
EB_ce&, LilAD_H23, LlSCLH345 
Lu_SC_H69. OVCAFLcefta, UlAD_H23 
EB_ceb, CALU6_cefc, DU145_cefls 
MB-MDA-453, LuJSC J*9, 293T^cefls 
UlSC -H345, UlSC_H69. PRSC.con 
MCF7, M&MDA453, Lq_SC_H345 
DU145_ceHs. UUSCJ045. Lu_SC_H345 
PC3_ceQs, OVGARjoeSs, Lu_SQ_H520 ... 
LNCaPjoeDs, CALU6_ceDs, PC2Lce0s 
248 HMEC (total RNA), HS578T^ceGs, HMEC 
DU145_cete. EB_peSs, CALU6js0s 
EBjceBs, iuJDjm, LilAD_358 
HT29_cefe, LilSQJ^. BT474jce0s 
EBLcefc. UlAD_H23, LuJuO_3K - 
Lu„SC_H345 f GAUUILceils, LujSC_H69 
Pv©-MDA453, 293T_CeIb, MB-MDA-435s 
LUCaP_cefe, Caco2, OVCAfLcete 
Lu_SCJi69, rWEC. EBLcete 
EB^cefe, DU145^ceb, LNCaP.ceHs 
246 UUJCLH460, EBjcells, LuJVDJ58 
PC3_ce0s, 293T_ceDs ) A549_cefe 

Hr29_ceBs. BT474„ce0s, Lu„SQJ<520 
EB.cete, LuJUDJi23, LuJU)„358 
245 EB_celb i LulAD_H23. UlAD_H23 
LxlADJ423, ffi.cells, LllSC_H89 
U_SCJi69. DU145_cefc, OVCAR^ceDs 
EB cefis, 293Tj»Us, PC3.ceBs . 
EB.cefts, LNCaP.ceBs. A549„ceBs 
MB4UDA435S, HS578T_cete, 293T_ceHs 
EB_cefls. hUtojm, LllAD_358 
PC3 joeSs. EB_cels, A549.cels 
244 PRSCcon, PRSCJog, Caco2 
MB-MDM53, Caco2. OVCARjsfe 
08002, OVCAFLceCs, LNCaP_cete 
CALUS^ceSs, OVCAR-cete. 293LceIl3 
Lu_AD_H23 f Lu_SCLH520 t PRSCjan 

EBjcelb, Caco2, DU145_cefe 
HT29_cens, HMEC (total RNA), HMEC 
LNCaP.oefls, HS578T_cefls, M&WDA^53 _r 
Ul_SCJ«9, BT474JSH3, LU_SQJi520 
EB_ceDs 1 LNCaP.ceQs, OVCAfLcefls 
EBjcells, Lu_AD_H23, PC3_ceDs 
EBLcells, A549.ce0s. 293T_cefc 
293T_ceDs, OVCAR_ce0s, Flora Waste 2 
EB calls. DU145jceHs. DU145_ceQs 
HMEC (total RNAX FIbroWasb 2, MB-MDAr435s 
A549_ce&, DU145„ceIIs, CALU6_ceHs 
243 EB_ceH3, LNCaP_C8Ss, Caco2 
EBjcells, Caco2 A549_cells 
MCF7, MBM1M35S, Lu_SCJi345 
EBjcells, LuJVD_H23, LuJ-CJ1460 
EBjcefe, Lu_SC_H345 f A549jdbDs 
OVCAfLceBs, EB^ceOs, DU145jDefe ' 
Caco2. MCF7, DU145_cels 
U.SC J1345, OVCAR_cefe. LuJSC J«9 
242 EB.cefls, UlAD_H23, UlSC_H69 
MB-MDA453, HT29^ceHs, UU£ JM60 
ULSCJW5, Rbrotots2 DU145_cefe 
RPW5J, PRSCJog, Lu_.SC JW5 
EBLcete, 293T_ceSs t DU145_ceHs 
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105633 AA287104 Ks.161388 U5 snRNP^pecffic protein (220 kDfccrft 2.41 

1CS532 AA453628 HsJ57443 ESTs 2.41 

132132 AA010933 Hs.4055 core promoter etenrarf bfcidtog protein 241 

111409 R00311 Hs.18798 EST; Weakly similar to Ml ALU SUBFAMU 241 

133813 M2K57 H&250711 dipepferyl cajtcxypepSoase 1 (angtofens 241 

127240 AA888387 H&243845 ESTs; Moderately similar to UlI ALU SUB 241 

104975 AA08HJ71 Hs^0758 chronxsonie^sscdaledpdypepSrfBC 2.41 

118078 N54321 Hs.47790 EST 241 

115840 AA43253 H&58103 Aktnase(PRKA)aiKtorpmtBin9 241 

101186 1208 H3.179881 oo^rcfmg factor, beta submit 2L4 

113088 T40936 H&8349 ESTs 2.4 

115185 AA259140 H&60238 ESTs 24 

113778 W15263 H&5422 ESTs Z4 

128261 AI061213 Hs.13179 ESTs; Moderately simiar to Ifll ALU SUB 2.4 

132210 AA235013 Hs.42322 A krase{PRKA) anchor protein 2 14 
112561 R72427 Hs.129873 ESTs;W^sa!*artoCYTO(>{ROMEP450 

127588 AA510677 Hs.1 68851 ESTs 2.4 

106664 AA460969 Ks.7510 mtogen^fivated protein kinase kinase 24 

131367 AA456687 1&26057 ESTs 24 

103163 X67683 H^apfensmRNA for keratin 4 239 

109639 F04444 Hs.6217 ESTs; Weakly simiar to IG1 ALU SUBFAM1 239 

112007 R42671 Hs.140853 EST; WeaWy similar to Ml ALU SUBFAMIL 239 

100023 AFFXcontratEoM 239 

119923 W86214 Hs.184642 ESTs 239 

127705 AJ003307 AJ0O33O7 Selected chr 21 cDNA Ibraiy H 239 

130362 AA182658 Hs.179817 DKFZP586FQ222 protein 239 

100168 D14874 H&394 adrerameduffli 239 
134261 AA227678 H&8084 Human DNA seq from done 465N24 on chr 1 

Contains two novel genes; ESTs; GSSs an 239 

103392X9493 Ksapfens dbi/acbp gene axon 1 &2 238 

129888 U81001 Hs.131891 Human SNRPN mRNA; J UTR; partial seq 238 

130119 T12649 Hs^51653 tubulin; beta; 2 238 

118136 N57710 Hs.233952 proteasome (jxosorne; macropain) subunli; 238 

131163 H80107 Hi23754 ESTs 238 

115964 AA446622 Hs.74313 ESTs Z38 

135026 H59730 Hs.93231 ESTs 237 

133300 D51401 Hs.70333 ESTs 237 

129948 H69281 Hs.13643 ESTs 237 

112505 R67923 Hs33368 ESTs 237 

130715 T98227 Hs.171952 ocdudin 237 

120301 AA192163 Hs.1 04085 EST 237 

128062 AA379500 Hs.193155 ESTs 237 

127154 AA789101 Hs.198860 ESTs; WeaMy simDar to Qn ALU SUBFAMJ 237 

102814 U90716 Hs.79187 coxsacfde vhs and adenovirus receptor 237 

120239 Z41691 Hs.65919 ESTs 237 

105829 AA481883 Hs31236 ESTs; WeaHy similar to Unknown [Rsapie 237 

132681 AA435762 H&54894 ESTs; Highly similar to unknown [Ksapie 237 

108845 AA132946 Hs.68864 ESTs 2% 

133226 T85327 Hs.169552 ESTs 236 

106788 AA478728 H&26373 ESTs; Moderately similar to 110 ALU SUB 238 

119236 T10166 H&237297 ESTs 236 

106619 AA459255 1&23956 ESTs 238 

109178 AA181600 Hs.62741 ESTs 236 

112724 R91753 Hs.17757 ESTs 236 

112655 R85Q69 Hs. 141 139 ESTs 236 

132820 AA454988 H&57621 ESTs 236 

106155 AA425309 H&33287 nuclear factor VB 236 

114632 AA084742 Hs.194380 ESTs; Weakly simiar to IS! ALU SU8FAM1 235 

134776 J05582 HsJ9603 mucin 1; transmembrane 235 

101192 L20859 Hs.78452 solute earner famity 20 {phosphate ten 235 

130349 W16686 Hs. 171825 basic hefo-kx>r>hefo ctomain containing 235 

106389 AA446949 Hs.6236 ESTs 235 

109637 P04426 HsJ23131 Idnesinfenty member C3 235 

101483 M24486 Hs.76768 procoBagervproline; ZoxogSutarate 4-di 235 

131751 H18335 Hs31562 ESTs 235 

131050 X13367 H s.2250 leukemia inhibitory factor (cholinergic 235 

130097 N21159 Hs. 14845 torkfiead box 03A " 234 

134533 AA013468 Ks^41493 natural WDer-tumor recognition seq 234 

134839 D63479 Hs.1 15907 daq^AoeiDl Unass: della (130kD) 234 

115690 AA410894 Ks.44159 ESTs 234 

129079 N91011 Hs.108502 ESTs 234 

123517 AA608525 Hsw243059 EST 234 

126239 AA527215 Hs.75879 rftoscrnal protetn L19 234 

124440 N46435 ESTs 234 

111468 R05809 rfe^05481 ESTs 234 

12958) H18428 Hs.1 13613 ESTs; MocteratetystmlartDlID ALU SUB 234 



293T_eefls, CALU6_ce&, A549_ceDs 
ffi__ceJs» OVCAR_ce2s, Caco2 
HMEC, HMEC (tote) RNA), EB_cete 
LM.SCJC45, Ut.SC J169, PRSCjxn 
KT2SLce&3 l BT474jceQs, r\ffl23LceBs 
Lu_SCJB45, DU145_cefls, LNCaR_ceDs 
OVCAfLceBs, DtM45_cefis, PC3_cefe 
EB_ce&, FarobBsts2 HMEC (total RNA) 
OVCARjbSs, EBLcefe, PC3_ceDs 
Ea.ceOs, DU145 jcefls, CAUUfijaOs 
Caco2, HT29_cefls, EBjceBs 
Lu_SC EB cefls, Caco2 
Caco2 MB-MDA-435S, LNCaP.ceOs 
DU145_cek. WCaP_ce&, OVCAfLcefe 
CaccZ DU145_ce&, PRSCJog 
24 Lu_SQJ6a), L1LADJC3. EB.ceDs 
LNCaP.cefls, DU145_ceCs, OVCAR_cefls 
OVGARjsBs, 293Lce&s, A549_cefis 
EBjcefls, fcB-MDA-453, 293T_cefl3 
EB_ceas, LlLAD_H23, Lu JU)_358 
EB.cefls, Lu_SC_H345, UlSCJ169 
MB-MDA-435S, LujSC_H345, iMjD lH23 
Caco2, Lu JUL358, LNCaPjsb 
BB.ceDs, HS578L.cefe, DU145_ceUs 
LiLAPJi23, ULSCjmS, Lu_LC_H460 
EB_cefls, DU145_cefls, PC3_cefe 
RbroWasts2, Caco2, HS578T_cefls 

Pf^Cjcon. N&MDA453, LNCaP_cefis 
EBjcefls, LuJUDjH23, Lu_SCjH69 
LNCaP_ceUs, LilSCjH69. UUILH460 
UlAD_H23. Lu_LCjH460, LuJjCJWSO 
293T_cefis, OVCAfLceOs, HS578T_cefls 
liLADjH23, Lq_SCjH©, LilSC_H345 
EB_ceDs, LNCaRjceBs, DU145_ceUs 
EBLcefls, 293T_ceDs, Lu_a?jH69 
OVCARjsDs, Caco2, CAUU6.ceDs 
EB_cefls, LuJVD_H23. Lu_SCJG45 
DU145_ceBs. OVCAR^cefis, 293Lcefls 
Caco2 LNCaP^ceHs, DU145_ceb 
LicAP-H23, S^cefe, PRSCjwn 
EBjcefls. LNCaP_ce8s, DU145_cefis 
HS57aT_ceCs, MCF7, Lu_SC jH59 
OVCAR_cefis, DU145_ceBs, Lu_SCjH345 
EB.cefls, DU145_C8fls. LNCaP.cefls 
EBjcefls. DU145_ce0s, OVGARjs&s 
EBjcefls, LNCaPjcefls, PRSC_con 
LuJ^_H23, UlADL.358, Lu^SQjH520 
Caco2. M&WDA453, MCF7 
MB-MDA-453, Caco2, OVCAJLcefls 
EBjcels, 293T_cefl3, LNCaP_cells 
LNCaPjafe, A549_cefc, Caco2 
Lu_SC jH345, LNCaP.cens, S.cefls 
Caco2, EBjcefe, DU145.cefls 
Fibroblasts^ UlADjH23, LujLCjH4Q) 
SLceSs, OVCARjcefe, HS578Tjceb 
OVCARjceSs, Lu_SC_H345, N&MDA453 
U,SCjH345, LuJLC_H460, UjJVDJC3 
DUl45_ceas, UcADJCJ, UlAD_358 
PC3j»fls, CALUB^ceUs, M&WDA^35s 
A549_cefe, DU145_cefe, HTOjcefls 
IWCaP.cells, PC3_cefls, DU145.cefe 
MB-MDA-435S, A549_cefls, LuJJC_H460 
PC3_cens, HSSTeTjcefls, EBjcsSs 
DU145^cefls, MB231_ceas, HMEC 
UJ^AD_H23, PC3jceBs. PRSCJog 
EBjcefls, LNCaP.cefe, LNCaP.ceQs 
EBjcefls, HT29.ce&, HMEC 
UOCJW60, Caco2, DU145_ceQs 
PC3jcefls, EBjcefls, OVCARjcefe 
ULAD-H23, Lu_SCjH69 f UlAO.358 
LllSC_H345, PC3_cefls, MB-MDA-435s 
BT474_cefc, LujLCJi460. LujAD_K23 
Lu^SCjH59, HT29_cefl3, MB-MDA-435s 
luJDjm, PRSCJog, LO.SCLK520 
UlSCjH69, LilSCj4345, LNCaR_cefls 
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104857 AA043219 Hs.19058 ESTs 234 

109647 F04587 Hs3241 ESTs 234 

117160 H97817 Hs.183302 ESTs 234 

112352 R5B974 Ha 167343 ESTs 234 

113653 T95745 Hs. 197433 ESTs 234 

131606 W56804 H&29385 AFG3(ATPase family gene 3; yeaspce 234 

101525 M29536 Hs.1 21 63 eutaryote translation fnMon factor 234 

125921 AA775Q29 Hs.122591 ESTs 233 

125m AA213555 Hs.29205 atpha trtfegrin bindtog protem 63 233 

108743 AA126917 Hs.71074 ESTs 233 

133735 AC0Q2045 Hs251928 nudear pore complex interacting protein 233 

120403 AA234916 H&243851 ESTs 233 

134998 RQ2207 Hs32679 ESTs; WeaHy simBar to mfcrotubute-base 233 

108456 AA079326 Hs.143654 ESTs 233 

130552 M86667 Hs.179662 nucteosome assembly protein 1-fte 1 233 

111114 N63391 Hs.9238 ESTs 233 

127767 A1269498 Hs.125543 ESTs; Moderately simflar to TADA1 proteJ 233 
106546 AA454725 Hs21056 H sapiens mRNA from chromosome 5q21-22; 233 

122379 AA446110 Hs250989 EST 233 

133650 D84294 Ha.1 18174 tetraticopepftie repeal domain 3 233 

106434 AA4490JB Hs3151 ESTs; WeaWy simfer to atopy related au 233 

105297 AA233451 Hs. 183858 transcriptional intermediary factor 1 233 

115976 AA447442 Hs.86327 ESTs 233 

105788 AA351031 H&23965 sofeite carrier femSy 22 (organic anton 233 



113774 W04550 H&9927 H sapiens mRNA; cDNA DKFZp564D156 (from 
110617 H6S772 Hs35820 ESTs; WeaWy similar to D34I8.1 [Hsapie 232 
102234 U26312 Hs.8123 chromobox homobg 3 (Drasophita HP1 gamm232 



114777 AA151689 Hs.184519 ESTs; WeaWy similar to NO ALU SU BFAMI 232 

125518 R20148 Hs.193651 ESTs 232 

130814 AA256695 Hs.19813 ESTs 232 

123473 AA599143 ESTs; Moderately similar to nil ALU SUB 232 

134310 AA313414 H&8148 H sapiens done 24856 mRNA seq; complete 232 

119192 R85375 Ha237262 EST 232 

114391 AA004876 Hs.133100 ESTs 232 

119133 R49144 Hs.1 19756 ESTs 232 

109710 F09792 Hs.12929 ESTs 232 

116726 F13681 Hs.42309 ESTs 232 

133206 R32993 Hs.6762 ESTs; WeaWy simflar to simflar to leucy 231 

135163 AA125988 Hs.199955 ESTs 231 

111219 N68836 Hs.19247 ESTs 231 

110283 H29565 Hs.12271 ESTs 231 
103772 AA092473 H&8123 chromobox rtomotog 3 (Drosophia HP1 gamm231 

12Z766 AA459386 Hs.194058 ESTs; Weaidy simflar to atypical PKC spe 231 

120886 AA365566 Hs.132736 ESTs; WeaWy similar to allograft intern 231 

123512 AA6QQ248 Hs.142245 HERV-H LTR-assodafing 3 231 

106644 AA460239 Hs.12680 ESTs 231 

127359 H72971 KIAAQ277 gene product 231 

105919 AA402494 Hs3990 ESTs 231 

125241 W86291 Hs.121593 ESTs 23 

104624 AA001936 Hs.1 84721 ESTs 23 

128765 AA1017B7 Hs.1 0494 ESTs 23 
108360 AA071539 zm74b6.s1 Stratagene neuroepitheflum (#9 

HYDROXYSTEROID DEHYDROGENASE/DELTA-5-DEL 

115682 AA410300 Hs.44618 ESTs 23 

134528 M23161 Hs.84775 Human transposon-Sce element mRNA 23 
111091 N58858 Hs33032 H sapiens mRNA; cDNA DKFZp434N185 (from 

134044 AA262475 Hs.76746 phosphodiesterase 8A 229 

118229 N62339 Hs.180532 rieat shock 90kD protein 1;a^pha 229 

110168 H20522 K&20969 ESTs 229 

125073 T87185 Hs.1 93633 ESTs; WeaWy simflar to IIII ALU CLASS C 229 

111495 R07210 Hs.19913 ESTs 229 

124024 F03077 Hs.106672 ESTs 229 

128230 AA984074 Hs.1 75757 ESTs 229 

125471 AA477571 Hs.1 52601 UDP-glucose ceramMe ghjcosyttransferase 229 

120734 AA299949 EST 12545 Uterus tumor I H saptens cDNA 3 228 

134349 AA406373 Hs.8208 ESTs 228 . 

123412 AA521443 Hs.187763 ESTs 228 

116297 AA469042 Hs£9498 ESTs 228 

104476 N33607 H&223014 protease; serine; 15 228 

101004 J04101 H&248109 v-ets avian erythroblastosis virus E28 o 228 

109991 H09813 Hs.1 2896 WAA1034 protein 228 

118934 N92571 H&54808 ESTs 228 

125096 T94328 Hs.1 94533 ESTs 228 

117514 N32226 Hs.124058 ESTs 228 

132792 AA401903 H&242985 hemoglobin; garrana G 228 

129009 AA131421 Hs.107884 ESTs 228 



LiLAD_H23, U.SCJ045, Lu_SC_H345 
HS578TLceDs, A549jceBs. CALU6_ceQ3 
EBjsfe. Fibroblasts 2, LlSCJ{69 
ffi_cefls, U_SCJI345, HT29jcefe 
M&MDA-435S, MB-MDA-453, UlSC JC45 
OVCARjceBs, Rbroblaste2, MB-MDA435S 
EBLcefls, Caco2, DU145„ce& 
293T_ceHs f PRSCJog, Lq_SCJ045 
EB cefis, DU145_cefe, LNCaP_ceBs 
ixi/jyjEk Lu_AD_358, U1J.CJW6O 
LNCaP_cetts, Lu_SC_H69, DU145_ce0s 
MB231_ee&, Ui_SCjm5 t Lju_SCJS9 
LNCaP.ceBs, BT474_ceBs, MCF7 
KT29_cefls, LqJ\DJ^23, RPWEJ2 
EBjceBs, A549_cete, DU145jceIs 
Caco2. EBLceto. MB-MDA453 
CALUGcelts, 293Tjcefe, PC3_ce0a 
OVCAFLceHs, Caco2, LNCaP.cefc 
BT474jE3eUs, RbroWasts2, MBM)A435s 
Ln_SC_H345, EB_cefls, EBjjells 
EBjobUs, LNCaP_cefls, Caco2 
EB.ceRs, LNCaP.celb, Caco2 
EB_cete, 293T_cefc, U_SCJi69 
EBjceBs, UlAD _H23. UlSCJB45 
232 OVCARjceBs, ffijcefe, UJSC_H69 
ULSCJC45, IjlAP LH23, PRSCjxm 
CALU6_celts, LNCaPjceb, A549jcells 
HT29 cells, FrorobJasta2. UlSC_H345 - 
HT29 ceils, HMEC (total RNA), MB231_cete - 
R»WDA435s, Lm_SC_H69, PRSCJog 
LNCaPjeils, DU145^cefls, Lu_SC_H345 
PC3 .cells, LNCaP.ceto, OVCARjstts 
Lu_SCJi69, PRSCJog, PRSCjcon 
PC3_ce&s, 293T_cefa, 293T_cete 
PRSCJog, 293T_cete, 293T_cefis 
UiJDJW, ULSCJ169, Lu_SCJ1345 
MCF7, BT474_ceBs, MB^WDA453 
DU145_ceDs, 293T_ceUs, EB_ceDs 
ULSCJS345, LNCaP.cefis, DU145^ceBs 
OVCARjceBs. LNCaP_ceSs, 293T_ceBs 
BT474_cdls, MB231_ceIls, M&WDA-453 
CALU6.ceBs, MCF7, DU145^ceHs 
HT29.cefls, BT474_ceBs, HMEC 
DU145_cefls, A549_cefls, LMJ.CJW60 
PC3_cefls. 293Tjcefls, DU145_ceQs 
HS578TjceBs, MB231_ceBs, Lu_SQJi52Q 
Lu.SC JW5, DW45_cete, OVCARjceBs 
HS578T_cete, EHJ145_ceDs, LNCaP.cefls 
HMEC, WVEC (total RNAX EB.ceQs 
DU145_cefls, PC3j»Ds, PRSCJog 
EB^ceOs, HMEC (total RNA), LuJ.Cji460 

23 HT29_ceBs, RPWE^Z LuJU)Ji23 
HT29_cete, Lq_SCLH520. Lu_ADJi23 
EB_ceHs, CALU6_cens, A549.ceto 
23 LNCaP^ceDs, DU145LceBs, PRSCJog 
DU145_ceto, A549_cefls, MCF7 
MCF7, DU145_ceBs, EB.cefe 
Fibroblasts 2, M&WDA-435S, Lu JC JM60 
a.cefls, UJSCJ1345, Lu^SCJi69 
CALU6_celb, EB.oeOs, MCF7 
HSoTBT^ceBs, RPWE-2, Lu JD l358 
LNCaPjcefis, DU145_ceDs, OVCAR^cete 
DU145_cens, PRSCjcon, PRSCJog 
LitADJC3, Lu_SCJ*345, Ul^69 
DU145_cefls, PC3_cefe, LNCaP.cells 
BT474^cefls, BT474^cefJs, Ut.SCJi69 
EB.cete. 293T_cefc, MB-MDA-453 
LNCaP.cete, MCF7, PC3jaIb 
HT29_ceDs, M&M2A435S. HMEC (total RNA) 
EBjcefls, CALU6_cefls, 293T - cefls 
HS578T_c8fe, 293T_cete, A549_cells 
UlSC J«45, Lu_SC_H69. 293T_cete 
CALUe^cete, HMEC, UU\D_H23 
OVCAILcells, LicSQ.H69, MCF7 
HS578T_celb, CALU6^ceBs, Caco2 



96 



WO 02/068677 



PCT/US02/06001 



111658 R16981 Hs.15276 ESTs 228 
112322 F55T57 Ks26457 EST 23 
133477 W69310 Ks.740 FTO protein tyrosma Hnase 2 22B 
132149 T1Q822 Rs.4095 ESTs 228 
115119 AA256524 Hs.46847 HrnnanDHAseqtocbnB30AiQonGhromo5 

yeastand archaea bacterial genes; and 227 
102130 U15Q09 Hs.1575 small rmdear ribonucteaprolB&i D3 polyp 227 
114343 Z41424 Hs21259 ESTs 227 
106746 AA476436 HsJ991 ESTs 227 
119359 T71Q21 Hs33334 ESTs; Hlghfy simfer to WS bask>Mx-l 227 
105301 AA435867 Hs. 168212 Knesin ferriy member 3B 227 
130280 L13738 Hs.153937 acfivated p21cdo42Hs kinase 227 
119724 W69468 Hs.47622 ESTs 227 
108960 AA150199 Hs.49378 DKF2P586D0919 protein 227 
103489 Y08614 HsJ9Q90 exparfin 1 (CRM1; yeast homobg) 226 
107711 AA015736 H&220687 ESTs 226 
131550 W84704 H&35380 ESTs 226 
107093 AA6096Q0 Hs.10018 ESTs 226 
113649 795641 Hs. 16400 ESTs; Weakly surfer to Hra [H.sap iens] 226 
105255 AA227498 Hs3623 ESTs 226 
130094 H43286 Hs.1670f7gamn^amInotH^aca(GABA)Biecepto 226 
111874 R37S59 Ha. 13353 ESTs 226 
107890 AA026O30 Hs£1311 ESTs; WeakJy similar to CALPAiN Z LARGE 226 
124628 N747Q2 Hs.102834 ESTs 226 
119707 W57569 Hs.44143 ESTs; Weakty simSarto SNF2a!pha protel 226 

106737 AA470080 Ha36237 ESTs; Moderate^ simlfer to prote 226 

117305 N22798 Hs.43248 EST 226 

134470 X54942 Hs£3758 CDC28 protein Kinase 2 226 

130734 T99337 HS.1B624 WAA10S2 proton 226 

128561 R69227 Hs.101489 ESTs 226 

100670 HG2992-H Beta-MexDsamiiifcfase, Alpha Pcfypepfcte, 226 

115953 AA443958 H&90960 ESTs 226 

129612 H17476 Hs.11615 ESTs; Hfchty similar to map kinase phosp 225 

111362 N91973 Hs.23595 deoxyribcnuciease III; dnaQ/mutD (E cd 225 

116275 AM85453 Ks250911 Intofteukin 13 receptor; alpha 1 225 

114461 AA024848 fe.126705 ESTs 225 

134083 AA278393 Hs.79013 ESTs 225 

132470 224724 Hs.4934 KsaplenspolyAsiteDNA 225 
114718 AA131328 2n8d1 ^1 Stratagene neuroepithefium NT2R 

SVWX)X3LMOUSE P45 CYTOCHROME C OXIDASE P 

129499 R40395 Hs242908 fedtfurniraiestemi acyftransferase 225 

124758 R38422 Hs.169168 ESTs 225 

130301 X83127 Hs.172471 potassium voltage-gated channel; shaker- 225 

131263 R38334 Hs24950 regulator of G-protein sfcnallhg 5 225 

107159 AA621340 Hs.10600 ESTs; Weakly stmBar to ORF YKR081 c (S.c 225 

133262 N72009 Hs2D6710 ESTs 224 

132985 AA093619 Hs.62113 WAA0717proteto 224 

114172 239043 Hs21421 ESTs; Weakly sfrnfer to cysteine desufiu 224 

127847 AA913387 Hs.126717 ESTs 224 

106499 AA452244 Hs.16727 ESTs 224 

105(85 AA150088 Hs27023 WAA0917 protein 224 

108876 AA134361 Hs.191453 ESTs 224 

121971 AM29667 Hs.120405 ESTs 224 

114334 Z41342 Hs22941 ESTs 224 

114565 AA053001 Hs. 103527 SH2 domain protein 2A 224 

115766 AA421761 Hs.77603 ESTs 224 

130989 AA608546 Hs21906 ESTs 224 

116304 AA489461 HsOT42 H sapiens mRNA for WAA054O protein; par 224 

111154 N66545 Hs29169 ESTs 224 

105561 AA262881 Hs.16029 ESTs; Weakly sbnSar to afiemafivety sp 223 

105939 AA404421 Hs.12258 ESTs 223 

126379 AI085342 Hs/166146 Homer; neuronal kraneifiateearfy gens; 3 223 

106610 AA458882 Hs.4832 ESTs; Moderately similar to Lasp-1 prate 223 

132786 AA424545 H&56851 H sapiens mRNA expressed In placenta 223 

107206 D20728 H&30767 ESTs 223 

133708 R42172 Ha.75667 synaptophyski 223 

135123 AA227567 H&9482 target of myb1 (chicken) homotog 223 

132156 AA157401 Hs.4113 S^enosyth()rnccysteine rr/rfrofese-Ste 1 223 

116934 H75624 Hs«3S662 ESTs 223 
133660 R87373 ym88e05Jl Scares aduftbiainN2b4HB55Y 

IMA6&166016 5\ mRNA seo, 223 

119468 W23633 Hs.125043 ESTs 223 

101247 133801 Hs.78802 glycogen synftase kinase 3 beta 223 



125008 AA253460 zsOBfiM^I NCLCGAP_GC81 H sapiens cDNA 

1229OT AA477119 zu37c7^1 SaaiBSovaiytunwNbHOTHsap 

TFfcG288289 G288289 MTTOCHO^RIALD-LOOP 



MMfflA-435s, 293T_cefa, A549_ceJfe 
Ui.SCJ^ ULSCJJ69, UlAP J58 
EB.cete, PC3_cefc, DU145_oj& 
LMCaPjcete, BjceQs. PC3_ceQs 

A549_cefe > EBcefis, LNCaP_ce& 
LMCaPjsSs, Caco2, BLcete 
HT29_cefls, OVCAR_ceDs, Fftn*fests2 
UlAD_358> RPWEJ2, ULADJC3 
U_SC_HS9, 293T.C6&, DU145_cefis 
OVCAR_ce8s, LNCaP.ceSs, EB_cefe 
M&WDA453, DU145Lcefa, DU145__cefls 
PC3_C8&, KT29jceHs, A549 jceDs 
EB_ceOs, HS57aT_ce&, LuJ\D_358 
EB.cefls, CALU6_ceQs, DU145_ceHs 
EB.cefe, LiLAQJi23, UlAD-358 
HS578T_ce0s, OVCAR^cefe, WBWDA^35s 
LMCaP.cefls, OVCARjcefis. DW45jeSs 
ULADJC3, UuSCJffiQ, PRSCJog 
HS57BT.ceDs. 293T_ce0s t UJSCJC45 
Rbrablaste2, MB231_ceb, 293T_cefls 
CALU6_ceDs, Lu_SCLH520, 293T_ceHs 
KT29^ceas, MB-MDA453, PC3_cefls 
293T_ceDs, CALUS^cefe, CALU6 cells 
293T_ceQs, OVCAR_cefis, Lu_SCJC45 
LNCaP.cefls, DU145^cefls, h®ACA-435s 
HT29jce&s, BT474_cefis > Ffcratiasts2 
DU145_ceDs, CALU6_peDs, l^lCaP^ceDs 
UlAD_H23, UuSC Ji345. liLSCJ«9 - . 
Lu_SCJO«, DU145^ce&, OVCARjceDs 
KT29_cefls, BT474_cefc, UlSO-H345 
Caco2, 293T_ceDs. DU145_ceb 
CALU6jceBs, LNCaP_ce&, PC3_ceSs 
Lu_SQ_H520, LuJ\D_H23, RPW^_2 
OVCAFLceDs, 2S3T_cefls, DU145_ce0s 
EB^cefls, hiJDJUX UUDJVQ 
293T_cefe, EBjceDs* OVCARjceDs 
EB.ceBs, HS578T_ce2s» Caco2 

225 MB-MDA435s, HT29_ceDs, UlSC Ji69 
HMEC (total RMA), Fibroblasts Z HMEC 
293Tjcefe, RPWEJ, LuJ.C_H460 
EB_ceDs, OVCAR^cete, A549_ce5s 
UlAD_H23. ffi.ceb, Lu^SC Ji69 
LMCaP.ceHs, HMEC, EBjceBs 
LilSCJ^ DU145^cefis, INCaP.cefe 
EB_ceOs, hiJDjm, UlAD L358 
293T_C8fls, CAUUejoeHs, l_u_SCLH520 
LNCaP.cefls, DU145joe&s, Lu_SC^H69 
U_SC_H345, M&MDM53, Lu_SC_H69 
DU145_cefe. LNCaP.cefe, CALU6jos&5 
EBjceDs, LuJSC J045. LuJVD _H23 
UlADJTO. 293FT^ce8s, CALU6^cete 
OU145jceSs, PC3_ce0s, EB_ceSs 
Lu_LCJM60, IWCF7, HNECfWalRNA) 
F2rottasts2, mtoDA435s. MB231jxBs 
PC3^ceDs, LNCaP.ceb, DU145^ceDs 
BT474„cefe, EB_ce&, LNCaP_cefls 
OVCARjcefis, MB-MDA-435S, HMEC 
HS578Tjce85. A549_cefls, HMEC 
EB.cefis, LNCaP_cefe, DU145_cefc 
HS578L.ceils, PC3_cefe, RPW^ 
DU145_ce&, MCF7, UlSC_H345 
BB.ceDs, UlAD_H23, Rbrab)asts2 
BT474_cefls, Rbrobtests2 MB^DM35s 
UlSC JW5, CALU6_cens, lu.SC J«69 
BT474_ceHs, MB231_ceDs, SjceDs 
DU145.cefis, 293Tjce2s, LHCaP_ce& 
CALUB^cens, Lu^SC_H345, IhJjCJW60 

DU145_cefls I A549_ceDs, PC3jcete 
293T_cefe, M&WDM53, OVCAR^cefe 
LNCaP.cefe, S_cefe. MB-MDA435s 
223 HT29_ceSs, PRSCJog, FfcraWasts2 

223 PC3_cens, MCF7, MB^DA435s 
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114148 Z38804 Hs.184777 ESTs; Moderate similar to OPIOID BIND! 

MOLECULE PRECURSOR [H^apfens] 223 

103433 X98001 Hs.78948 Rab geranytgerajiyttransfejase: beta subu 222 

132954 AA027112 Hs216194 ESTs 122 

133228 N90Q29 Hs.6831 H sapiens done 1400 unknown protein mRN 222 

103891 AA242B87 Hs.124186 ring finger protein 2 222 

124883 R75630 Hs.177242 ESTs 122 

109921 H05734 HsJ0559 ESTs 222 

127306 A1305162 Hs.193687 ESTs 222 

102707 U77456 Hs.78103 riucJeosame assembly protein 1-fite 4 222 

108193 AM27625 Hs23272 ESTs 222 

118819 N79045 H&50800 ESTs; Weakly similar to till ALU SUBFAMI 222 

134326 U16306 Hsi1800 chondral sulfate proteoglycan 2 (vera 222 

112241 R51248 Hs.160Z7 ESTs 222 

123693 AA609591 Hs.112728 ESTs 222 

123052 AA496297 Hs.182740 rfoosorrdrjrofetoS11 222 

122481 AA448271 Hs3126 ESTs 221 

128895 R37753 Hs.106985 ESTs 221 

124691 R05835 Hs.1 10153 ESTs; WeaHy simfer to BCB1 GROWTH FA 

131556 AA442853 Hs2869 cydirniependefit kinase 5; regulatory so 221 

128869 AA424570 Hs.106736 ESTs 221 

107114 AA610089 Hs.1 1776 U4fli6^ssodatedRNAspBcingJactor 221 

106255 AA431191 Hs.161489 ESTs 221 

130724 AA37C091 Hs.179680 ESTs 22 

105483 AA255874 Hs23458 ESTs 22 

118970 N93503 Hs£4961 stoned Bm^nA-alphaibeMke factor 22 

120805 AA346041 Hs£6844 ESTs 22 

106158 AA425382 Ks.6553 ESTs 22 

102121 U14391 HSw82251 myosinic 22 

109446 AA232125 H&87062 ESTs 22 

129515 AA490882 Hs.1 12227 ESTs 22 

113128 T49325 Hs.8977 ESTs 22 

127269 AI041014 Hs220752 ESTs 22 

129912 AA047344 Hs.107213 ESTs; Highly simaar to NY-REN-6 antfejen 22 

115700 AA411685 H&67709 ESTs * 22 

106267 AA431873 Hs.4988 H sapiens done 24711 mRNA seq 22 

112881 TD3593 Hs.182814 ESTs 119 
116902 H70739 yu69f1U1 Weizmann Olfactory EpitheBum 

IMA6E239085 3' sfmQar to contains LTR 119 

105621 AA280865 Hs.6375 Hsap!erarT^cD^OKFZp564KO2220rom 

126991 R31652 Hs£21 bigrycan 119 

125466 R08234 Hs.180461 ESTs 119 
108491 AA082973 zn7g1.s1 Stratagene hNT neuron (§937233) 

togtxM3672 6S RIBOSOMAL PROTEIN L7A (H 

109978 H09356 Hs.22528 ESTs 119 

106990 AA521354 Hs24758 ESTs .119 

122362 AA443919 Hs.95840 ESTs 119 

125367 AI016490 H&81964 SEC24 (S. cerevsiae) related gene rami 119 

110716 H97188 Hs35Q96 ESTs 119 

129297 R11267 Hs.1 80570 H sapiens chromosome 19; cosmid F22329 119 

104992 AA102652 Hs22753 ESTs; WeaHy similar to coded for by C. 118 

119898 W84738 Hs.137319 ESTs 118 

118594 N69022 Hs.49599 ESTs 118 

129786 H98977 Hs246109 ESTs 116 

104325 D81608 Hs.150675 polymerase (RNA) U (DNA directed) polyp 118 



123022 AA480909 aa28f10.s1 NCLCGAP_GC81 H sapiens cDNA 



Atorepeti^elernerrt^xmtatoseJeniert 118 

133572 W94333 Hs.7499 transferase of tanermitDcr^^ 118 

133363 AA479713 Hs.71962 ESTs 116 

135361 AA053319 Hs.167700 ESTs 118 
128319 AA808904 Hs.1 15095 ESTs; WeaWy stmBarto RHO-RELATED GTP- 
DU145_ce0s 

128660 AA011597 Hs.1 77398 ESTs 118 

114877 AA235618 Hs205125 ESTs 118 

125925 H28737 ESTs; Moderately similar to Ml ALU SUB 11B 

113427 T85105 Hs.15471 ESTs 118 

117500 N31909 Hs.44278 ESTs 118 

131384 F13608 Hs.26226 ESTs 118 

134499 070370 H&84136 palrecMte ruwieodornain transcripfionfa 118 

128154 AA922969 Hs.127100 ESTs 117 

134585 T48154 Hs.168655 H sapiens n^fwH^bmcfirigfactor-2 117 

104987 AA101723 Hs.16683 ESTs 117 

132992 AA091017 Hs.6226 ESTs 117 

135311 M36089 H&98493 X-ray repair (^emerrting defective rep 117 

113171 T54613 HsS761 EST 117 

117736 N46999 Hs.46648 ESTs 116 



HS578Tj08&s, Fawobbsts 2, Lu_SCJ*345 
LNCaPjoefls, EB_ceOs, 293Tjcete 
EB_cefis, UUD _H23, Fibroblasts 2 
293Tjce0s f PC3_ceto, DU145_cefl3 
EBjceCs, UUSCJi69. UlSC_H345 
EB^ceUs, LilAO JC3, U_SC JB45 
LH.SQJI520, 293T_cefls, RPWEJ2 
MCF7, HT29.C6IS, MB-MDA-453 
Caco2, EB.ceDs, CALU6„cefls 
293T jceOs, EB jcefls. A549_cefc 
Lu_SCJi345, UlSCJ«9, DU145_cens 
HS578T ceSs, PRSCJog, CALU6_ceHs 
293T eels, HMEC (total RNA), HMEC (total RNA) 
HT29_ceHs, HMEC (total RNA), BT474jsDs 
EB.ceQs, UlAD_H23. UtAD.358 
LilAD J123, HT29.celb, LilAD_358 
EBjdbHs, Lu_AD_H23, UlSCJ*345 
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IJLADJC3, Lu_SC_H69, UlSCJC45 
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DU145^cefi5, LNCaPjceBa, OVCAILceDs 
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293T -> cefls. Caco2, U.SC J«9 
U_SC_H345, DU145_cefis, MCF7 
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A549jceb, KT29_cefls, m_SQJ4520 
293TjDefls, HS578Tjceb, PRSCjoon 
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DU145_ce0s, EBjceUs, HS578TjceBs 
HT29_cefis, BT474_ce3s, Lu_ADJt23 
HMEC, HMEC (total RNA), EfiLcelis 
CacoZ HT29_cefls. LNCaP.ceb 
EBjcefe, LilAD_H23, Lu_SC-H69 
U.SC J<345, Uijtf) jm t Luj^)_358 
HS57oTj»Ils, 293T_cefis, OVCAR_oeBs 

LlLAD_H23, EB_cete, PRSCjcon 
EB_oell3. CacoZ PRSCjcon 
293TjceDs, U_SC_H345, CALU6_cefe ^ 
HS57&T_cefls, PRSC_con, OVCARjcelb 
RPW^Z HMEC (total RNA), HMEC 
Fibroblasts Z OVCAR^cete, 293T_ceto 
F&roWastsZ MB^DA-453, PRSC_con 
E0 cells, LNCaP.cefls, LNCaP.ceBs 
D_fiDJV3, LtL.SC_H69, Lu_SC_H345 
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EB.ceDs, Lu_SC_H69. 293T_cefts 
DU145„cefls, BT474_ceIls, LiLSCJffi9 
LilSC Ji69, Lu^SC _H345. PC3 _cefe 
MB-MDA-435S, Lu_SQJi520, lffi-MDA-453 
HS579T„cetl3, 293TjceQs, OVCAR_cefls 
EGUcefis, 293T_cete, PRSCjcon 
ULADLH23, Lu_SCJi345, PRSCJog 
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122529 AA44S328 
104612 R99199 
116465 AA621650 
123155 AA488414 



125752 AKJ73373 
126455 N80749 
123339 R77869 
115021 AA252Q28 
129054 TB7231 
101261 L35545 
132697 AA281951 
124380 N26536 
103967 AA303711 
119403 T92935 
125755 R66080 
101843 M93405 
113032 T24024 
112563 R72632 
126432 AA583825 
101638 M57763 
125174 W51835 
1061® AA425943 
135343 AA236796 
105267 AA227956 
134331 AA452Q2Q 
121634 AA417012 
131394 R72637 
111526 R08260 
125049 T79840 
120433 AA237077 
13498 AA449789 
127805 AA740921 
109275 M1962B7 
120683 AA290987 
135415 X60655 
132925 AA252759 
101875 M97287 
101463 M22490 
129177 T95005 
130725 W88946 
105549 AA262417 
124543 N63706 
123062 AA482069 
109464 AA232857 
129619 AA610116 
127545 AA935809 
133068 R73427 
113609 T93263 
106645 AA460270 
126256 Z21124 
129697 R00841 
126730 T19477 
125244 W86466 
134762 M91036 
119564 W38206 
132523 AB002332 
127758 AJ337031 
126471 AA158755 
110911 N45120 
122317 AA442742 
100253 D38024 
120431 AA236884 
122449 AA447638 
100961 J00148 
130908 W86389 
102643 U67849 
127932 AA398510 
109207 AA19C9C6 
102598 U62962 
124470 N51702 
104961 AA076672 
124164 H30667 
126468 AA242853 
129683 WD5348 
105350 AA235737 



2X6 
2X6 
2X6 

2X6 
2X6 
2X6 
206 
2X6 
2X6 
2X6 



K&99229 ESTs 

Hs.173063 trajBdx±i4ite enhancer of spQ 2; hom 
Hs.41045 ESTs; WeaJdy similar to WAA0734 protein 
Hs.76127 hect{hooioSo^toflieE&AP(UBE3A)ca 

<tomah{RlD)1 
Hs.183275 ESTs 

Hs.1 11515 ESTs; Weakly sirn2ar to predicted using 
Hs 28506 ESTs 
Hs39168 ESTs 

Ks.168289 succinate dehydrogenase complex; subunit 
H&82353 encbMaiceDprotehC^divatedpro 
H&551B H sapiens mRNA; cDNA DKFZp5^J2146 (from 
H&84999 ATPase; Cih+ transporting: beta pdypept 2X6 
Hs.144700 ephrto-BI 2X6 
Hs.119908 ESTs; Highly s&nDartDraictecbrprotel 2X6 
Hs.191268 H sapiens mRNA; cDNA DKFZp434N174 (from 
Hs.170008 roethytn^oraf&^emiaJderr^ 
Hs7387 DKFZP564B116 protein 2X5 
H&2S282 ESTs 2X5 
H&235B60 ESTs 2X5 
H&89474 ADP-rfcosyte6on factor 6 205 
H&231082 EST 2X5 
HsX2208 acyW^enzymeAderryrf^^ 2X5 
H&9914 foffistaft) 2.05 
Hs£5348 foffistafrv^J© 3 (secreted gfycoprotei Z05 
H&234156 ESTs; Weakly sfmilar to C6M28 protein 2X5 
H&28921 ESTs 2X5 
Hs26343 ESTs 2X5 
H&20131 ESTs 205 
Hs.1 11798 ESTs 2X5 
Hs.180777 Hsap^mRf^cDNADKFZp564M0^(n^ 
Hs.75511 connecfive fissile growth factor 2X5 
Hs.1 197 heet shock 10HD protein 1 (chaperonii 10 2X5 
H&20303 ESTs; Moderately sim2ar to Ml ALU SUB 
Hs.49657 ESTs; WeaJdyamSar to contains similar 
HsX9967 evejKk^homeob(sc1{h^ 
H&238296 DKFZP434A033 proteh 
Hs.74592 special AT-rich seq binding protein 1 (b 
Hs.68879 bor^rrorphogenefic protein 4 
H&2Q9587 ESTs 

Hs.18508 putative gfyche^^acyflransferase 
H&5415 ESTs 
Hs.104573 ESTs 
Hs.100847 ESTs 
H&87100 ESTs 

Hs.1 1663 tetraspanNET-6 protein 
Hs.1 15899 ESTs 
K&235712 ESTs 

Hs.16875 ' ESTs; WeaWy similar to hypothetical pro 
H&27695 midline 1 (OpSz/BBB syndrome) 

HSAAADNVETEST1, Human ariuft Teste fcs2X4 
Hs.172069 DKFZP434G212 protein 2X4 
A1426R Heart H sapiens cDNA done A1426, 2X4 
Hs.132756 ESTs; WeaWy similar to, KIAA0591 protein 204 
H&242985 hemoglobin; gamma G 

Accession not Dsted to Genbank 
Hs.50722 dock (mouse) homolog 
Hs.180195 ESTs 

Hs.175652 ESTs; WeaWy simto to llfl ALU SUBFAMJ 
H&22305 ESTs 

H&8693 ESTs; WeaWy simfar to Iffl ALU SUBFAM1 
Hs-247951 Himifariosoaputohurr^m 
H&247323 HsapfensmRNA for G4 protein (G4 gene; 
Hs.104977 ESTs 

Accession not listed to Genbank 
Hsu21122 ESTs; Moo^ratery similar to WAA0438[H. 

Hummbel»9alttiQsiteatiha« l &da^ 
Hs.133148 ESTs 
H&2046g ESTs 

Hs.106673 eukaryofc translation irr3aficn factor 
Hs.101392 ESTs 
Ks33905 ESTs 

Hs.7535 ESTs; Highly similar to COBWtfloe ptecen 
Hs.237856 ESTs;MateaterysirnfertocAKP InrJucfl) 
Hs.158196 DKFZP434B103 protein 
Hs.186571 ATPase; Na^+trartsprjr&ig; alpha 3 pd 



2X5 
2X4 
2X4 
2X4 
204 
204 
204 
204 
204 
2X4 
204 
204 
204 
2X4 
204 
204 
2X4 



2X4 
2X4 
2X4 
2X4 
2X4 
2X3 
2X3 
203 
2X3 
203 
2X3 
2X3 
2X3 
2X3 
203 
2X3 
2X3 
2X3 
2X3 
2X3 
2X3 
2X3 



Dtf145_ceB3, HS578T_cett3» 293Lce0s 
MWffiM^s, 293T_cefe, 293T_ce0s 
MB231_ce2s, Hra_ceDs, LuJU>_33 

DOl45_ce5s, CALlffiLceSs, PC3_ce0s 

LNCaP.cefis, EB_cefe, DU145_ce2s 

CALU6_ceIs. PRSCJog, OVCAfLceUs 

EBjaDs, BT474_ceDs, UlADJE3 

UlSQ_H520, Ftorobbsts2, S_cefe 

Caco2, LNCaP_ce&, EB_ceHs 

EBjsQs, RPWEJ. DU145_ce& 

2X6 OVCAfLcefc, WCaPjaDs, DU145_ceQs 

Caco2, Caco2, 293T cefls 

H129jcelh, HMEC (total RNA), HMEC 

HMEC. Sjcefls. HMEC {total RNA) 

2XS IKCaPjceBs, DU145_ceBs, OVCAFLceQs 

LNCaP_cefls t Nffi-MDA-453, S.cefe 

EBjsQs, A549_ceHs, A549_cefa 

MCF7, re578T.cels, PRSC_con 

MB231_cels, HT29_ceSs, nbrobJasts2 

DU145^oe&, LNCaP^cefe, PC3_ce2s 

EB.cete, Rbrob)asts2, LuJ\D _H23 

OVCAR.cefc, PC3_cefe f EB ceb 

HMECftofalRNAX PC3jcefls. HNEC 

HMEC, RPW^ HMEC (total RNA) 

EB.ceSs, CALU6_cefe, A549_ceDs 

HS578T_cefls, EBjsto, UlSCJI345 

EBjceOs. UlSC_H69, UlAD_H23 

ULADJC3, Lu_SC_H69, BT474_cefe „ " 

HT29jcefc l Lu^ADJi23. Lu_SCJi345 

2X5 DU145_ce0s, CALJUS.cefts, PC3^ce0s 

HS578T_cete, PRSCJog, PRSCjcon 

DU145_ce0s, LNCaf^cefe, OVCAFLceQs 

S_cefc, MB-MDA-453, RbrofaIaste2 

ULADJ358, ULSCLH520, LuJjC_H460 

UlAD _H23. RPWEJ. U.SCLH520 

293T_cefls, HS578T^ceSs, LMCaP.cels 

EB_ceOs, Lu_SC_H69. 293T_cefc 

PRSC.con, HT29_ceHs, MB231jcells 

293T_cete, MB^lDA^35s. Lu_SCJ«9 

HT29.cefis, Fibroblasts 2, MB-MQA435s 

DU145_cells, OVCAR_ceDs, PC3_ce& 

Caco2, 293T_ceCs, DU145_cefis 

UU\D_358, HT29_ceDs, HT29_cefc 

DU145_cefc, LuJVD_H23, LNCaP eels 

BT474_ceDs, Caco2, LNCaPjceSs 

BT474_cefls, M^MDA-435s, MB-MDA-453 
Caco2, OVCAFLcefe, MCF7 
SLcefe, UlSC_H345, PRSC.con 
A549_ceBs 1 293T_cefl3, Caco2 
Fibroblasts 2, FaxDbfasts2, MCF7 
HTZ9jce0s, Lu_SQ_H520, BT474_cell3 
EB.ceOs, LlAD LH23. Lu^.SCJ«9 
EB__ceCs, UlADJ<23, Lu_LC _H460 
MB231_cefis, LilAD.358, HT29 ceBs ^_ 
BT474.C8BS. HT29_cete, LlAD JUS 
PC3.cefls, OVCAFLcefe, PRSCJog 
293T_cefc, M&MD/V435S, A549 ceils 
EB_ce&, Lu_AD„358, Lu_J.C _H460 
UlAD JC3, RPW^2, Lu„LC_H4«) 
EBjoeQs, Fibroblasts 2, LlSC_H345 
LujW)JI23, UlAD_358, Lu_SCLH520 
Lu_XJi69, EB.ceHs, Ul.SCJ»« 
LLSCJB45, Lu_SCJi345, Lu.SQJi520 
HT29.ce0s, BT474.ceDs, HMEC 
293TjceDs, UL.SCJC45, OVCAR_cefc 
HTajceBs, 293Lcefis, UlSC JQ45 
EB_ce&, LuJSC JJ345, UlSCJ^69 
ULSCU620, ULSCJ045, liLSCJi69 
EBjceSs, DU145.C8QS. MCF7 
HT29_ceUs, Fibroblasts 2, HMEC (total RNA) 
Caco2, LNCaP^ceHs, EB.ceOs 
CAU^.ceife, CAUUe.ceQs, A549.ceBs 
MB23l_ce0s, BT474_ceDs, Fibrobfesis2 
HT29.celb, h©M}M35s, UiJD Ji23 
fcB-MDA-453, Lu_SCLH520, UlAD 358 
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129794 AA447772 H&14520 eukaryofc translation tnMon factor 2.03 

115664 AA405974 Hs£4673 tumor necrosis fector (ngand) supertax 2.03 

119096 R41672 H&91471 ATPase type IV; phospholipid transpwfin 2.03 

133866 L38151 Hs. 171 625 phosphatidynnositol44dnas8;ca!aI^ 203 

132055 N63440 H&38132 ESTs 2D3 

125691 AI034361 Hs.135150 lungtype^ceOrrambiane^sscdatedgly 203 

121376 AA4Q5699 Hs.166232 ESTs; Moderately similar to SODIUM- AND 

TRANSPORTER 2 [Ksaptens] 2.03 

105289 AA233178 Hs.103000 WAAD831 proteh Z02 

100967 J02621 Hs^51064 h!g>mob®y group (nonhistone chromoso 2.02 

124430 N38913 H&221575 ESTs 2D2 

128322 AG06331 Hs.133296 ESTs 2M 

131077 X91B09 H&22698 G alpha toteracfing proteh 2.02 

108033 AA040923 H&92200 WAA0480 gene product 2.02 

107350 AA0O1O45 Hs.46783 ESTs 2.02 

109475 AA233159 H&87t31 ESTs ZQ2 

111400 R00144 Hs.189771 ESTs 2.02 

117516 N32495 Hs.151560 ESTs 2jQ2 

120506 AA257955 Hs.173705 ESTs; WeaWy simfar to Iffl ALU CLASS C 202 

130850 N39306 H&20237 DKFZP566C1 34 protein 202 

123118 AA486571 Hs.105696 ESTs; Moderately similar to 119 ALU SUB 202 

111285 N71704 Hs.4310 eukaryottc transbfon fnifeton factor 202 

119106 R42362 Hs.91785 ESTs 202 

111370 N92915 Hs.94631 brefekfin A-inhibaed guanine nudeoflde 202 

155013 767261 Hs.154431 ESTs; WeaWy similar to neuronal thread 202 

129762 AA460273 Hs.12372 WAA0517 protein 202 

120704 AA291970 Hs.107054 K1AA0821 prateai 201 

105355 AA235985 H&26938 Human DNA seq from done 126A5 on chromo 

genes (one with DnaJ domains); the gene 

fam*y member HKR3. Contains ESTs; STSs; 201 

125952 AA017723 smaD inducible cytokine A5 (RANTES) 201 

103478 Y07755 H&38991 S100 cakaurrvbhcflng protein A2 201 

133544 T33873 Hs.74624 protein tyrosine phosphatase; receptort 201 
112746 R93237 yq11e10.s1 Scares fetal Iver spleen 1NF 

IMAGE: 196650 ?, mRNAseq. 201 

.118513 N67504 Hs.40061 ESTs 201 

123423 AA598484 H&238476 EST 201 
127854 AA769520 ESTs; Weakly similar to REGULATOR OF MFT 
LiLSQJfi20 

111843 R36969 Hs.18888 ESTs 201 

100221 D28383 Human mRNA for ATP synthase B chain, S\J 201 

129966 AA452237 Hs.194443 ESTs; Weakty slmfer to BC37295^2 [H.sap 2.01 

106798 AA478968 Hs.20558 ESTs 201 
114636 AA085374 zn13d&s1 Strafagene hMT neuron (8937233 

gfcL8441 CYTOCHROME C OXIDASE POLYPEPTl 

12348 H21585 Hs.191277 ESTs; Moderately similar to ATP binding Z01 

130620 AA233245 Hs. 16773 ESTs 201 

106471 AA450118 H&25722 ESTs; WeaJdy similar to ZINC FINGER PROT 201 

134175 T33128 Hs.7966 ESTs 2 

117291 N22289 yw36gOEs1 Morton Fetal Cochlea H sapien 2 

134199 U47635 Hs.79877 myotubularin rented protein 6 2 

128758 AA129545 Hs.181165 eukaryolic tenslafion etongafion fector 2 

112005 R42569 K&22444 ESTs 2 

122521 AA449433 Hs, 149227 ESTs; WeaWy slmfer to PROUNE-R1CH PRO 2 

130356 X84373 Hs.155017 nuctear receptor tiiteracfrg protein 1 2 

114067 Z38153 H&26921 ESTs 2 

107136 AA620795 H&8207 ESTs 2 



EBjJeDs, \MJD J58, UlADJ*23 
UlAD_358, HT29jsfls, HT29_ce& 
HT29jceDa, MB231 jefis. BT474^ceDs 
293Tj»b l 011145.0* IflCaPjcete 
Lu_SCJC45, MB-MDA-453, M&-MDA-4353 
Lu_SCJB45. LNCaPjceDs, DU145_cefc 

LNCaPjoeOs, HT29_cefe, RPWEJ? 
PC3_ceDs, ULADJC3, MB231jcefe 
MCF7, DU145jceto, OVGARjoefe 
MB-MDA435S, Fibroblasts 2, EB.ceHs 
HT29_ce&, MB-MDA4353, Lu_SC_H345 
UlAD_H23, RPWIL2. MCF7 
MCF7, Fibroblasts 2, DU145_ce!ls 
DU145_cefls, PC3jDe0s, QVCAR.cetts 
HT29jcete, MM4DM35S, LtLSCJ*69 
HT29_cefls, Fibroblasts 2. HMEC 
HT29_cefe, HMEC (total RNA), FDroWasts2 
MCF7, Fibroblast LNCaPjsfis 
EB.cefis, LiJDjm, UUC_H460 
CALU6_cefc, 293Tcefls. PRSCJog 
293T_cefls, PC3 cells, EB ceQs 
CALU6_ceBs, M&MDA-453, PC3jsfe 
EB_cefls, OVCAR.ceDs, LNCaPjoete 
Lu_SCji345. Lu_SC_H69, PRSC_con 
EB.celfe, MB-MD*435s. OVCARjaOs 
LilSCJ«9, EBLceBs, MB-MDA-453 



UiJ&JUZ, UUCJM60, LilSCLH520 
UCaPjceBs. DU145_ceJis, MB231_cefe 
HMEC (total RNA), HMEC. RPWEJ2 
Lu_SC_H345, BfT474_cefls, HT29_ceBs 

PC3jceDs, LflCaP_cefls, OVCAfLcefe 
Lu.SC JMS, Ui_SCJ«9 t PRSC.con 
EB.ceSs, bxJD JI23. UlSC_H345 
2.01 HS578T_cete, CALU6.celb, 

UiJ«3 lH23, LuJ\D_358. LUJ5Q.H520 
EB_cefls, la JD_W, LNCaP.ceils 
ULSCJ1345, Ul.SCJ«9, DU145„cefis 
EB.ceto, ULADJ123, LuJXJM60 

201 EB_ceb, CALU6_ce8s, OVCAFLcefls 
EBjcefs, HS678T_cefls, PC3_ce8s 
EBjcefia, DU145_ceb, 293T_ceBs 
OVCAFLceDs, LNCaP_cefls. EBjcete 
LU_SC_H345, Fftrobbst32, luJDJW 
MBM)A453, OVCAR_ce0s, CALU6_ceiIs 
EB.ceUs, PC3_ceD3, LNCaPjoeDs 
U_SC_H69, EB.cefc. Lu_SC_H345 
ULADJE3, PRSCJog, LuJU>J68 
HT2SLceto, RPWEJ. MB231_cels 
DU145^cefis, PC3^ceDs, MCF7 - 
293T_ceils, MB^DA435s, HT29.cefls 
LNCaP.ceb. PC3jceDs, fficeUs 
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Table 3 



BxAcco: 
UmgenelB: 
Uragmft Title 



Vmqat Eos probesct identifier number 



Umgene gene tiiic 



Ptey ExAccn UGJD CompfeteTTtte 

302347 AF03S400 Hs.194659 chtorida channel; calcium acfivated; fern 

316304 AI936587 Hsl221589 ESTs 

339196 CH22JT113D11.GB*SCAN3-1 

336171 CH22JhGB4ES.7Q8_3 

3388% CH22_DJ32M0.6ENXAN^2 

333625 CH2^J=GBIES-223_2 

333730 CH2£_FGENE&258 1 

320244 AA2S6922 Hs.129778 gastrointestinal peptide 

333643 CH22JGENES23ZJ> 

333423 CH22JGENES.147.3 

302332 AI833168 Hs.184507 H sapiens Chromosome 16 BAC dora C1T387 



Ratio BSJMet Top 3 expressing cell Ones 



CH2ajGENESJD6 - 2 

322033 AL137507 EST du^er (not In UniGme) 

3083)1 AI719930 EST singleton {not in UniGene) with exon 

339044 CH22J)A59H18.GENSCAN.27^ 

314516 AA371513 Hs231748 ESTs 

327805 CR05JisgI}5867968 

334239 CHELFOB<ES364_2 

332958 CH2£J=<3ENES.48 15 

313386 W85772 Hs.173924 ESTs 

314350 AL037927 Hs.190575 ESTs; Moderately sfctfar to ffll ALU SUB 



337170 
337503 
337562 
337219 

311331 AI679622 Hs32225 

314251 AA713589 

336246 

335009 

339365 



334966 
334666 



316830 AW182106 Ks.127821 ESTs 



CH22J=GENES364.1 
CH22_FGENES^03-1 
CH22LC65E1.(SNSCAN.1-2 
CH2^FGBES.61« 
fcronunogfobuin alpha 1 
ESTdusler(notmUniGene) 
CH22LR3ENES.746J 
CH2*_FGENES.47*J3 
CH22JA354I12.GENSCAN34-1 
CH2?J=GENES388J7 
CH22JSENES.465J6 
CH22J=GENE&418 18 



339413 
337951 
330153 
333987 
334304 
338990 
333152 
327049 
337225 



334451 
333594 



336796 
333313 
336833 
336090 
336645 
334565 
333242 
326304 
337445 
327413 
327990 

325038 H38304 
314923 AT732489 



334476 



CH22_DJ579N16.GB4SCAN.5-8 

CH2^BtAC005a».GENSCAN.94-1 

CR21_p2gi}4325335 

CH2?J=G9*ES310J1 

CH22J=GBIES373_7 

CH22J3A59H18.GB4SCAN5-6 

CH22JH3BJES.89 1 

CHJMJis g56531965 

CH2ZJGENES.62&-3 

CH22JH3ENES.168.6 

CH22J r GENES387 11 

CH22_FGBE&210JJ 

CK2LFGB*ESl228J 

CH22J=GBfES.17&6 

CH2*J=GBJES.138_5 

CH22.FGB4E&242-2 

CH22J=GBIES.6B9_2 

CH22JGB*ES26-1 

CHZLB3B*ES.405_5 

CH22_FGB*ES.111_6 

CK17JjspJ5867277 

CH2*/GB<ESJ69-4 

CH3*JisgpB67750 

CH36J*gi[5868218 
K&21782 ESTs 
Hs.136370 ESTs 

CH37J» gJ56381928 

CH22JGENES394_7 

CH2?JGB*ES389_0 

CH22JGBCSL305J 



19.71 EB, NCJ-H520, NCW23 

14.49 PRSC.con, RPWE-2, OVG^R 

1037 NCJ-H69, PRSC.con, NCI-H345 

9.45 NCm9,NCU4460 r NCI-H23 
931 PC3,BT474,OVCAR 
836 NCW469, PRSC.ccn, NO-K345 
832 N(m9.BT474 l N®WDA-231 . . „ . 
8J22 BT474,CALU6,DU145 
7.66 MCF7, NCmS9, LnCap 
737 HT29, MB-MDA-231, B5 

735 MB-MDA-231, KT29, MB-MDA-453 

7.46 HT29,OVCA*,bTT474 

7-35 PRSC.con, PRSCJog, NC W345 

633 PC3, DU145, DU145 

6.46 NOH59, NCW345, PRSCJog 

6.41 EB, OVCA-RCaco2 

6J28 NCJ-H69, NO-H345, PRSCron 

6.09 NOH520, MB-MDA-435s, MRWDA453 

6.04 NO-H69, PRSC.con, PRSCJog 

538 M&M3A-231.0VCAR, BT474 

534 OVCA-R.CALU6, EB 

537 UCap,CALU6,NOH69 

536 NCI-H345, PRSC_COn, RPWE-2 

533 HT29,Fi/B-MDA-453 t BT474 

5.45 NOH69. NCI-H345, PRSCJog 

5.43 Nd-H69,NOH23, NOH345 
5.41 PC3, EB.LnCap 

534 NOH69, NCI-H345, PRSCJog 
531 EB,S.NCW69 

5.25 PRSC_oon, NCI-H69. PRSCJog 

531 PRSC.con, Caco2, PRSCJog 

5.16 DU145, BT474. M&M0A331 

5.15 NO-H345, PRSCJog 

5.12 NCI-H345, PRSCjcon, PRSCJog 

5.08 NCJ-H69, NO-H345, PRSCJog 

5.01 NOH345,NC(4{6%PRSCcon 

5 PRSC.con, PRSCJog, NCI-H59 

436 M&-MDA-231, MB-MDA-453, P^MDA453 

436 OVCA-R t CALU8 1 NaH23 

435 PRSCJog, PRSC.ccn, NCW69 
439 fc&M)A435s, OVCA-R A549 

437 PRSC.con, NCW345. PRSCJog 
433 DU145, CALU6.EB 

431 NO-H69, NOH345, PRSCjcon 

4.79 RPWE-2, PRSCjcon, NCPH69 

4.78 OVCA-R,PC3,HT29 

4.78 NCI-H69, PRSCJog, PRSCjcon 

4.73 NCI-H69, NO+B45, PRSCJog 

4.72 NCW69.NCI-H345, PRSCJog 

4.7 NCW345, NCI-H69, PRSC.con 

4.7 NG-H69, PRSC_con, PRSCJog 

433 HT29, OVCA-R, DU145 

4.62 NCI-H345, PRSCJog, RPWE-2 

436 NO-H345, PRSCJog, PRSCjcon 
4.48 OVCA-R, S.DU145 

4.47 RPWE-2, NCI-H69, PRSCJog 

4.46 NCI-H69, PRSCJog, NCU&45 

4.44 PRSC_con, PRSCJog, RPWE-2 
4.43 PRSCjcon, M&MJA-231, HT29 
4.4 Hra,MB-MDA^31,NCW58 
4.4 OVCA«.BT474,A549 

438 OVCA-R, PC3.EB 

435 PRSCjcon, Caoo2, PRSCJog 

435 NCMI69, NCI-H345, PRSCJog 
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MVHhC 


PW99 FZ2PNPQ ATI 17 


AM 


RPWF-2.HT9Q OVCA4* 


337947 








337504 


OW99 ctcckiec fifl9_9 


A 


Mpiiril*; PROP mn PR^C tan 
riW*fw*K?i rrvOv__COJI, rnuy^jy 




OUOO PftFNCC949fl 




D1I14S HTM OVTJUR 


3380® 




A 99 


nvrnWI, rnOvJHAli WWTW'N 


91 o cod Ki^oeoc 




All 


ppq acAq RT17A 




OJJ99 E/5CMCC 997 9 


A\ 


nVPJL-R PQQP rra 1 nfian 
UVWvn, rrvOv—LAjii) uioap 


9H9A4C mi/ocq 
OUZMO Ml4*O0 


COT 

col 


A 97 


DO QP rm PROP fevi RPWP-9 




PU99 Cftcxteo ran 9 






330007 


01499 cftojcc cm in 


A 9*; 


MrHjJCTfl Pom9 MB MOfl iS3 
NUHkjxU, UoCOa WlO-mUnr'KM 


■JOfWcn V -IT TOO 

£0002 Y133Z3 


ns.i4ozyo ocsiTiiEgnn protease 


A 9«; 


MQ_Mn&-991 ntHAfi RT47A 


oU44oQ AA4303/3 


to I Sing Kaon (not tn unxienej witn exon 


A 99 


MPL9Cft kJ^ILMcn MPLH99 
NUKwOf NLrHTZJ 


327273 


OU 04 ha nflCOCTACC 


A OO 


NUHiby, NLHWO, rKoOJDon 


334540 


CHZcJ-bfcWto.4u3_5 


il 47 

4.17 


NCt-nby, NUK1340, rKou_J0g 




l^tCs^UCTlCO.'tZI^U 


A 47 


MPU4RQ MPLW9A*; RPWP-9 
NUHT03, MUHuytO, WtVC-t 


327827 


OU/VC /*nCfiC70CJl 

vyil.uo ns yifooo/ ifoo 


>l 47 

4.1/ 


UVO/V-K, NU-noV, UALUO 


333598 


OU99 pficwre -wo 9 


A -17 




329638 




4 4C 

4.1b 


pi i-t/ic MDimA_994 UT90 


307556 A1281651 


to i singieion (not in uniuend) wnn exon 


A 4C 


BT474, nIZy, UALJUO 


336836 


CH22JGENES^47-11 


4.15 


PRSC_C0n, NCK1345, NUHloy 


323187 AL12110) 


Hs240038 ESTs 


4.14 


Wf*\ 1MAB KID ftlTlA it^Ra DDUUC *> 

NCH1340, Mb^nLW-WOS, nrWt-Z 


33S397 


CH22JGETOS.823J2 


4.1 o 


Mr»l Ln>tC DQCO mm DDUJC 9 

Nm-n340, PHoO_Con, KrtfVt-Z 


325007 AA736429 


EST duster (not in UnEene) 


A 49 

4.13 


NGhto, PRoCLoon, NLfrott 


300199 AI304386 


Hs.150836 ESTs 


4.11 


NCht34o p PRSCjCOn, PRovJog 


335832 


CH22JGcflES.620_6 


4.08 


NCi-no9, NCm345, PRoLJOg 


312778 AI631655 


Hs.197919 ESTs 


4.07 


NUKioo, NGI-rCo, PKoO_COn 


323164 AA765301 


Ks.151858 ESTs 


4.06 


& l/x. lKjo ACVtfk UTIQ 

NQ-H23, A549, HTZ3 


315871 AW135312 Hs.117237 ESTs 


A AC 

4io 


MB-MUA-231, cd, MCr7 


337452 


CH22J6EWES.775-1 


4.02 


noon nn. odop i— — upluo4c 
PRSCjDOn, PRoCjOg, NUHu40 


335255 


CH22J=GB€S\521_1 


4.01 




335200 


CH22JGENES\508J 


4.01 


NCl-rHw, PRoCJog, PKoujcon 


336917 


CH22_PGB<ES3464 


i An 


PRSCjCOn, NCm34o, rKoojog 


336584 


CH22JH3E1IES.847J 


3.S8 


OBOC Inn DOC/" mn DDtA/C 9 

rKovJOg* rnoujeon, nrwe-z 


333382 


CH22J=6EflES.t*U1 


3.97 


CD AC An LTPM 

cB, A549, KTZ9 


3294% 


CRYJsgf[5868883 


3^7 


oT474 t rCo, nl£t 


336929 


CHZLPGBIESJ4M 


354 


fti/%i ucfi hoi Lrajc nopp inn 
NCRioS, Nu-n345, PR5C jog 


337238 


CH22JGENES\641-3 


3,92 


» ir>i | ln«ft » Itrt <|E DDDO Inn 

NCl-noy, Nu-H34o, PRSOJog 


333875 


CH2?J=GENES^91J1 


332 


PRSCjoon, RPWE-2, PRSCJog 


337069 


CH22J=GENES.448-2 


3.9 




332491 M24470 


Hs.1435 guanosta monophosphate reductase 


3.86 


OVCA-R, M9-Mt/Ar43oS, UALUO 


304623 AA521331 


EST singleton (not in UniGene) with exon 


3.86 


f\\t/*A. d ni Mile py*Q 

OVCA-R, DU145, PC3 ■ 


335348 


CH22JGB*ES\537_4 


3.85 


KTZ9, MS-MDA-Z31, rw 


334568 


CH22JGENES.405J 


3JJ5 


NUfno9, NCht34o, PKbUjog 


336924 


CH2?J=6ENES^47-9 


O Oil 

3.84 


ur»i mac Doer Un ddwc o 
NOwu4o, PRSCJog, KrWt-Z 


301654 H81795 


EST 




M/Hi4G9A 1 nPan MPL9f>A 


334677 


CH2?J=GENES.418J0 


3.B3 


nocr mn Mot ui/ic mpluco 
rKov/jCOn, HUHwO, NUHTOif 


326688 


CR2D _hsg^867582 


J.OO 


ijpiLlO^C QDO/* mn POOP Irm 

NUhrkWO, rr<oU_Cun, rnovJOg 


327790 


CR05_hsg^5867977 


3.0 


ODCP nnn DDCP Wi UOIUOJC 

PRoujoon, PRoojog, nuhi34o 


334591 


CH22_FGENES.408_1 


3.0 


NvKW9, PRovJOg, Nv/HWW 


337974 


CH2?^^COC55C0.GENSCAN.10&^ 


O 70 

3.70 


BOOr* Inn BDCT nnn MCLUQAR 

PKovjog, PKoO_ccn, nohto*io 


311274 AW293128 Hs.197101 ESTs 


3.78 


MOt UOVIC HOC/* nnn DOIA/E O 

NCkw45, PRoVJDOn* RPWc-2 


326668 


CH^OJts gi^552455 


3.78 


HOI UO»C MOI UCO DDCO Inn 

Nut-n34o, NCknoy, PKbojog 


304195 N35382 


EST singleton (not in UniGene) wffli exon 


3.77 


MOI UCO DCAAIE O DDCO nnn 

NGHiby, RPVtc-A KKoUjOOn 


336294 


CH?^FGBCS.786J 


3.77 


DDCO nnn DDCO Inn MOI UCO 

PRoCjoon, PRovjog, Nomoa 


311613 AL046311 Hs.252443 ESTs; Weakly simSar to III! ALU SUBFAMI 


3.76 


imyi BfTAtA I1D ktTSA 9Q4 

HI 28, DT474, R«J-MUA-Z31 


338123 


CH22JBAAC005500.GENSCAN.19&5 


3.70 


KAD ITtA 004 UT9Q OTATA 


318230 AA558125 


EST duster (not in UniGene) 


3.74 




303985 AW514501 Hs.156110 Immunogfobufri kappa variable 1&£ 


3.73 


Rjra imA <V04 DTil7>l DDCO nnn 

lve-MUA-231, BT4/4, PKoUjCQn 


336502 


012^GENEo\833J& 


3.72 


MOI IMAK. DDIAfC 9 DDCO nnn 

NI/K1343, RPWt-A rnoOjDOn 


334083 


CH22JH3EHES327J7 


4 74 
3.71 


MOLURO MOL149AC DHCP mn 

IxuhrUKf, NUKtwo, rnov/^cuii 


333600 


CH22JGBmS2\ZJt 


9 7 

3.7 


MOIUCO m/OA D Dp9 
NUrnOV, UV WW, rW 


339424 


CH22J>J579N18.GENSCAN.14^3 


3.09 


MOLUCO MOLI49AC DROP rnn 
NLrHloy, NUHU40, rflOOJDDn 


336862 


CH22J=GENES\297-2 


O C7 

3.0/ 


MOLU94H DROP mn PROP fm 
NUrrkWO, r^ou_.oon, rr»ow_joy 


334823 


CH22J=GB^ES.437J5 


1 C7 

3.0/ 


DDUJC 9 DDCO Lvi PROP rnn 

(Wit-it, rrvoujog, rr»oo_uuii 


329940 


CH.16_p2gfj6165199 


3.00 


OAI 1 Ffi CD &JOC7 
WUJUO, CD, MXjTI 


300275 AI632123 


HS-231521 ESTs 


3.00 


DDCO mn WOLWRQ RPWP-9 


328820 


Ca07_nsgi5868330 


9 PA 
3.00 


Mpiucq MPLW945. PRSC con 

nUrrnN, l\UrfwrrO, rnwjMl 


332398 AA446446 Hs.104788 H sapiens done 24554 unknown mRNA 


3.66 


PRSC con. PRSCJog, NCI-H345 


325791 


CH.14J*glj6682476 


165 


NCWtM5.BT474,LnCap 


300672 R14469 


HSw256573 ESTs 


3.65 


MCF7, MWi©A453. MBM1A-435S 


338344 


CH22^tAC00S0OGBJSCAN312-8 


3.65 


NOH345, PRSCJog, PRSC^con 


333257 


CH22JGEHES.118J5 


3.65 


DU145,EB,OVCA^ 


332140 AA620724 Hs, 11 2890 ESTs 


3.65 


MB^DM53,EXJ145,MCF7 


337489 


CH22JGENES.799-2 


3.63 


NC44J345, NCI-K69, PRSCJog 


305167 AA663080 


EST singteton (not tn UniGene) with exon 


3.63 


OVCA-R, M&WDA-231. M&MWDM35S 


3362)0 


CH22J : GENES.719_4 


3.61 


NC4-H69, PRSCJog, NC1-H345 
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339208 CH2?JF113D11.GBeCWiW 159 
320090 AB0Q2058 HsJ 13275 pjrfne^recqrtm'P2Xto1;oiptenr 158 

335999 CH22J50*ES£57J 157 

332909 CH22JFGBES36.13 157 

306531 AA991423 ESTdngfeDn(nothltoiGene)wffliexon 156 

333261 CH2*jFGB<ES.119J 1K> 
303883 AA17639S Hs.169624 ESTs 154 

335831 CH2*JGB<ES32Dj5 153 

333983 CH2*J=G0*ES31OJ 152 

333623 CH2£J=GB*E&222J 151 

333997 CH22_FGB^ES310_22 15 

325623 CH.14Jsg¥867000 15 

309151 AQ358S Hs.140 lmmurog!oJ^ganiiH3((^roaifcBi) 1.49 

305080 AA641485 EST singleton (not in UnJGene) with exon 149 

339268 CH2ajA354l12.GENSCAN.106 147 

310048 AH98352 Hs.105077 ESTs 147 
314758 AA521458 Hs.1 92738 ESTs 146 

334664 CH2£J=GENES.418 15 145 

334661 CH22_FGENES.418j3 145 

330984 H38678 Hs32766 H sapiens dona 24803 mRNAseq 144 

333464 CH22_R3BCS.160J 144 

333580 CH2?_R»CS.199^ 142 

313356 AJ266254 Hs.1 32929 ESTs 142 

334518 CH22J=GB*ES.400.1 141 

333627 CH22J=GB«S225J 14 

309641 AW194230 Hs2531Q0 EST 14 

338221 CH22_&tAC005500.ffl^AIl246.10 3.4 

312993 AJ392673 Hs.1 25230 ESTs 14 

318336 AJ97180S Hs.1 64158 ESTs 138 

326218 Cai7_hsgiJ5867226 138 

336231 CH22J=GBES736J3 138 

307912 AJ382224 EST singteton (not &)UniGene)wfli exon 337 

336161 CH22jrGBES.707_6 137 

300875 AW134756 Hs.192477 ESTs ~ 337 

336593 CH22JGB4ES.135.1 137 

310696 AW31620 Hs.1 60875 ESTs 336 

304745 AA577771 EST singteton (not In UrcGsne) wffli exon 136 

308911 AI86Q287 Hs.1 56110 Immunoglobulin kappa vartabte 103 336 

336347 CH22J=GaCS.815Ji 336 

334906 CHZLFGB£S.452J1 333 

334548 CH2*_FGaE&403_13 333 

336695 CH22JGBIES.48-4 332 

316684 AA807187 Hs220783 ESTs; Weakly sfanBarto WNT-1 PROTOONCO 

315901 AI521558 Hs.1 79718 wnyb avian myetobbstosis viral oncogen 33 

320115 T93574 EST duster (not in UnlGene) 33 

307847 AJ363993 Hs.1 57273 EST 33 

327899 CR06_hsgfI5868156 328 

304612 AA514207 EST singleton (not bUni6ene)wffli exon 128 

330021 CH.16_p2g5S671889 127 

338132 CH2^_BAAC005500.GBJSCAR20(>2 127 

323690 AA317497 Hs.188897 ESTs 127 

327362 CK01Jwg|6552412 326 

333488 CK2*JSB*E&167_3 126 

334106 CH2*JGB<E&330_5 126 

306990 AI129298 Hs.146491 EST; WeaHysinflar to FERRITIN HEAVY CH 126 

328420 CR07JB gi|5868411 126 

336214 CH22J=GB4ES.722J 326 

330565 U51095 Ha 1545 caudal type noroeo box barecription fed 325 

333879 CH2*J=GB*ES291.15 325 

300145 AI240850 Hs232016 ESTs 325 

327581 CR03_hsgjp887825 3^5 

308153 AI500429 Hs.1 103 transforming growfo tutor, beta 1 324 

308337 AI608847 EST singleton (not ki UmGene) with exon 324 

329406 CKXJ*g!I6682547 323 

325482 CR12JisgiF866957 323 

337544 CH2^J^ES3337 322 

337204 CH22JG9CS595.1 322 

309451 AW1Q512B Hs246687 EST 322 

337259 CH22JGBE&6493 32 

336489 CH2£JGB<ES331J0 32 

334804 CH22_FGB£&435_4 na 

335739 CH2?J=GBES£01 10 3.13 

306264 AA935305 Hs.179779 ribosomai protein 117 117 

32938S CRXJtsgff6004484 117 

323479 AA278246 ESTduster(nothUniG8fle) 116 

304731 AA576085 EST singfeton (not In UmGene) wfth exon 116 



PRSCjcon, NCW69; PRSC Jog 
OVCAHUiCap,NOK69 
NOH345, NOKS9, PRSC.con 
NOH345, PRSC.con, PRSCJog 
BT474, MWADM51 MWUDA435S 
HT29,CALU5.fc®M3A.231 
NCUmNCW345, RPWE-2 
MCF7,Br474,OVCA* 
NCU045, PRSC.con, PRSCJog 
NCRffi9. PRSC.con. PRSCJog 
NOH345, PRSC.oon, PRSCJog 
CALUS,HT29,BT474 
EB, MCF7, MB-MDA453 
NOH23,NCm460,NO358 
NO-HKJ, NO-H345, PRSC_ccn 
Caco2,PRSC_coaNW«9 
NCW23,NCHC3,Nd4i52) 
NCW69. PRSCJog, PRSCjcon 
NO-H69, PRSCLcon, PRSCJog 
OVCA*.MCF7,PC3 
NCW69. MB-MDA-231, MCF7 
PRSCjcon, NCUC9. PRSCJog 
RPVYE^. PRSCjcon, NCI-H345 
PRSCJog, PRSC.con. RPWE-2 
HT29,BT474.BT474 
HT29.M&MDA451MCF7 
NOH69, PRSCJog, NCW345 
PRSCJog, NOH345. NCI-H345 
OVCAREB,CALU6 
NCW460 f NCHH69.NCW345 
NCKK69, NCI-H345, PRSCJog 
NCW345. PRSC.con, RPWE-2 
NCI-H69, NCWC45, RPWE-2 
RPWE-2, PRSCJog, PRSC.con 
PRSCjcon, NCWi69, RPWE-2 
HT29,OVCA*,BT474 
NCW345, RPWE-2, PRSCjcon 
EB, DU145, CALU6 
NCH*9, PRSCJog, PRSCjcon 
Caco2,CALU6,MB-Pv©A453 
NCW345, PRSC.con, NCI-H69 
NCJ-H69, PRSCJog, PRSC con 
131 DU145.S.M&MDA-231 
Caco2,LnCap,NCW<69 
DU145,HT29,CAUJ6 
NOH345, PRSCLcon, PRSCJog 
BT474.M&MDA231.A549 
DU145, CALUS.LnCap 
A549,HT29,ffi 
MB-fvOA-231,CALUS,ffi 
RPWE-2, NCf-H345\ MCF7 
NO-H69, RPWE-2, PRSCJog 
NOH69. NCLK345, PRSCJog 
NCR«9, PRSC.con, PRSC Jog 
NCW045, PRSCJog, PRSC.con 
NCI-H69, NO-H345, PRSCJog 
MCF7, ffi, OVCA-R 
EB.DU145.HT29 
PRSCjcon, PRSCJog, NCW69 
NCW345, PRSC.con, PRSCJog 
EB,DU145,MB4yiDA453 
MCF7, S, EB 

PRSCJog, PRSC.con, NCW345 
DU145,HT29.M&WDA^31 
NOH69, NCW345, PRSC.con 
NCI-H69, NCW^ PRSC.con 
NCI-H59, PRSC.con, PRSC Jog 
PRSCjcon, RPWE-2, NCI-H345 
PRSC.con, NWC45,NCm69 
CAUJ5,MWm435s l Caco2 
PRSCJog, PRSCjcon, RPWE-2 
NCI-H69, RPWE-2, PRSC.con 
LnCap, NCt-HK, B 
RPWE-2, NCW345, PRSCJog 
PRSC.con, NCRC45, RPWE-2 
NCR^9,LnCap,DU145 
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339419 CHZUXB79N16.6BJSCAN.11-11 3.15 

301202 AI536797 Hs.173155 ESTs 3.15 

333608 CH22J=GENES.216J ai5 

339193 CH2ZJF113D11.GENSCAN.1-5 3.14 

310527 AW293404 HsJ>1198S ESTs 3.14 

321146 AA707443 Hs.183983 ESTs 3.14 

333271 CH22_FGBfc&121J2 3.13 

330280 CH.05 _p2 gi|8671910 3.13 

309977 AW451663 EST singleton (not In UniGene) with exon 3.13 

307588 A1285535 EST singleton (not In UniGene) with exon 3.13 

330551 U39840 Hs.105440 hepatocyte nuctear factor 3; alpha 113 

314404 AW104203 Hs.157505 ESTs 3.13 

334030 CH22_FGaiE&32D_2 3.13 

309108 AI925949 EST singleton (not In UniGene) with exon 3.13 

317516 AI733250 Hs.192262 ESTs 3.12 

304161 H71886 EST singleton (not fnUniGene) with exon 3.12 

334590 CHZLFGENES407.13 3.12 

333408 CH22JGENES.145 6 3.11 

330387 H14624 H&31386 ESTs; HJghJy simSar to secreted apoptos 3.11 

332567 N23730 H&25647 v-fos FBJ murine osteosarcoma viral onco 3.11 

333682 CH22JGB4ES347J0 3.1 

323152 AI680562 *te246192 ESTs; Weakly slmflar to REGULATOR OF MTT 

311142 AI638441 Hs.195649 ESTs 3.1 

333441 CH22J=GB«ai51_5 3.1 

326459 CH.19_hsgi}5887400 3,09 

313493 AA910339 Hs.126868 ESTs 3.09 

339356 (mjA354l12.GENSCAN.31-1 3.08 

333629 CH22J=GBE&226_5 3.08 

304127 H42981 EST stogteton (not In UniGene) wfth exon 3.07 

325691 CH.14JisgiJ5867021 3.07 

333014 CH22J=GBCS.61_6 a07 

327379 CR02Jisgtp867795 3.07 

337816 CH2^B*AC005500.GENSCAN.13-1 3.06 

337954 CH22^tAC00KC0.GBJSCAN.9W 3X5 

328109 CR06Jegij5868020 3J05 

338527 CH22^AAC00550aGENSCAhU96-15 3.05 

320083 T87761 EST cluster (not In UniGene) 3.05 

333485 CH22J=GBCS.161_2 3.05 

334788 CH22JGBES.43*J3 3.04 

302681 X97550 EST 3.04 

336238 CH22J=GBJES.743Ji 3.03 

337606 OB^C20H12GB*SGAN.17-2 3.02 

333545 CH22JGBCS.180 1 3.02 

309782 AW275156 Hs.156110 lmniunop>^kapp^vaffebte1C)^ a02 

324277 AA429440 HsJ07285 ESTs 3.02 

321074 H38098 Hs32756 ESTs 3.02 

337094 CH22J=GB4ES.465-19 3.01 

313913 AW391342 EST duster (not In UniGene) 3 

329140 CRX_hsgI|6017060 3 

335331 CH22J=GBiE&535_4 3 

334827 CH22_FGENES.437_9 2J99 

326029 CH.17Jisgi|5867176 * 299 

303100 T09353 EST 299 

328768 CR07_hsgi)6O17O31 2.99 

329392 . CHXJisgi)6478815 298 

305168 AA663105 EST singleton (not in UniGene) wflh exon 298 

300992 AA601213 Hs.191798 ESTs 298 

334474 CH22/G0ES.394_5 298 

322647 AA007534 H3. 125062 ESTs 298 

310620 A1341328 H&178953 ESTs Z97 

328276 CHX7Jisg{J6004471 297 

331018 N26904 H&24048 ESTs; WeaHy sintfar to FK506/rapamydn- 296 

321523 H78472 Hs.191325 ESTs; WeaWy similar to cDNA EST yk414c9 2.96 

339280 CH22_BA354I12GENSCAM14-12 256 

305967 AA886428 EST singleton (not to UniGene) with exon 2.96 

335755 CH2*JGB€S.604jl 295 

323907 AL043098 Hs.165387 ESTs 2.95 

330370 CHXp2gq6580495 295 

334529 CH22_FGENES.40?J 294 

339258 CH22JA354»2GB*Satfi7-11 294 

334783 CH22J=GENES.432J 294 

335266 CH22_FGB€S.521J2 294 

323707 AA845957 Hs.128385 ESTs 2.94 

336199 CH22J=GBCS719_3 293 

338OT CH22-B*AC005500.GENSCAN30a5 293 

333652 CH22J r G84ES^39_1 253 



NCW169. PRSCJog, RPWE-2 

LnCap, NCI-H69, Caco2 

NCW345, PRSC jan, PRSCJog 

NCI-H69, NCI-H345, PRSCjxm 

PRSCJog, PRSC.con, RPWE-2 

PRSCjoon, NO-H69, PRSCJog 

NC1-H345, NCKH69, RPWE-2 

NCI-H69, NCW345, PRSCJog 

PRSCjsn, PRSCJog, RPWE-2 

R»WDAr231,BT474,BT474 

MB-MDA453, LnCap, Caco2 

DU145,S,OVCA-R 

NCI-H69, NCW345, PRSC_con 

BT474, MCF7, EB 

OVCA^EB t M&WDA453 

PRSC_con,NCW«9.RPW&2 

UCU^ NCR&45, PRSC_con 

PRSC_tog t RPWE-2, PRSC_con 

DU145.OVCAtf.PC3 

EB, MB-MDA-453, MCF7 

PRSCjoon. PRSCJog. RPWE-2 

3.1 PC3,MB^©M53,DU145 

PRSCjoon, RPWE-2, PRSCJog 

RPWE-2 NO-W345, PRSCJog 

EB,CALU6,PC3 

NCI-H345, PRSCjoon, RPWE-2 

NCW69. NO-H345, PRSCJog 

NCI-H69, NCt-H345\ PRSCJog 

LnCap. PRSCjoon, DU145 

NCW345, PRSC joon, NCW69 

PRSC.con. PRSCJog, NOH345 

PRSC.con, PRSCJog, NCI-H69 

NC W69, PRSC.con, PRSCJog 

PRSCJog, NC W69, NCW345 

KT29.OT474.ft4B4«)^231 

N(m9, NO-H345, PRSC.con 

BT474, MB-MDA435S, MCF7 

NOH345, RPWE-2 PRSCJog 

EB,A549,CAUJ6 

O VC A-R, EB, MB-NffiA-453 

NOH69, PRSCJog. PRSCjxm 

HT29.BT474.M&MDA-231 

NCW69, NCI-H345, RPWE-2 

PRSCJog, PRSCjoon. RPWE-2 

BT474,MB-MDA-231,HT29 

PC3, BT474, K&MDA-231 

PRSCjxm, PRSCJog, RPWE-2 

NCW345, RPWE-2 PRSCJog 

EB.DU145.PC3 

MB-MDA-4353, KT29, BT474 

CALU6 r EB,DU145 

NCW345, RPWE-2 PRSC_con 

MB-MDA-453, NQ-H345, RPWE-2 

NCI-H345, PRSC_con, NCI-H69 

NCW69, NCI-H345, PRSCjxm 

LnCap, NOH345,MCF7 

Caco2,HT29,NCk358 

NCUK9, PRSCjoon, RPWE-2 

HT29, OVCA-K.A549 

PRSCjxm, RPWE-2, PRSCJog 

NCI-H345, NCI-H69, RPWE-2 

Caco2NCW460,A549 

PRSCjxm, PRSCJog, NOH345 

NCW69, PRSCJog, NCUI345 

NCJ-H520, NCI-358, MB-MDA-453 

EB,A549.MB^DA453 

PRSCjoon, NCIK345, PRSCJog 

EB,DU145.MB^DA435s 

EB.MCF7, DU145 

NCW69, NCI^345, PRSC.con 

A549,Caco2PC3 

NCI-K69, PRSC.con, PRSCjoon 

NCI+1345. PRSCjxm, PRSCJog 

NCH469, NCI-H345, PRSCJog 

NCWK69,NCVH345 r NCi^58 

PC3,OVCA«,BT474 
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338479 CH22JGBJES329J39 232 

338086 CH22JGBES388J5 292 

338516 omjammm.GBisctm32s 292 

320121 T33557 EST duster (not (nUnfiene) 292 

305782 AA844730 ESTsa^etaitiotliUriGenBjwSiexxxi 232 

333304 CttSLBA354l12GBiSCAfi2D-16 231 

327472 CR02jBojp867775 291 

311458 AW139426 Hsk244718 ESTs 231 
338431 CH2Z3tAC00550aGENSCAN3514 29 

339230 CH22_BA354m.GENSCAN.1-6 239 

320586 NWL00365 EST duster (net hUniGene) 239 

304777 AA561692 H&2186 eutenyefc transfeSon ekmgafion facto 239 
337768 »C23^C0Cfl0S7.GBfSCAN.1196 238 

319465 AA319115 Hs.191558 ESTs 238 
319068 W93011 Hs.110155 ESTs 237 
330958 H08815 Hs.159824 EST 237 
334215 CHZLFGBJES357.7 237 

333568 CH22J=G£NES.185 1 237 

333142 CH2ZJ=GB<ES35_5 237 

330239 CR05_p2giJ6671857 237 

302120 R55140 Ha31075 ESTs; WeaHy sknQarto Weak stmflarily 237 
338679 aC2_3tACOO55OO.G0rfSCAR47O-1 236 

329041 CHXJb g35888564 236 

333541 CH22J=GENEai78J3 236 

337011 CH22JGBES.427-* 236 

324031 AA375646 EST duster (not to UniGote) 236 

331842 AA416586 H&98232 ESTs 236 
336599 CH22_fGBIES350_3 235 

337586 CH22J20H1ZGENSGAN.&4 235 

336177 Cm2J=GENESJ12^2 235 

337522 CH22LFGENES319-1 235 

338595 CH2^EMtAC005500.GENSCAR437-2 235 

309522 AW15C044 Hs 252259 ribosomal protein S3 235 

336981 OC2JSBIES397-7 235 

330286 Ca05j32glJ6671913 234 

333713 CHZLfGBES251_2 234 

335068 CH22SGENESA83J5 233 

309175 AA641288 Hs.181165 etikaiyofctrare^elongaficm factor 283 

326380 CK19Jis gi|5867327 282 

334970 CH2£J=GENES.466_3 232 

337097 CH22JGENES.47M 232 

323676 AI702835 EST duster (not In UniGene) 232 

333785 CH22J=GENES^74 4 232 

334175 CH22JGENES349 10 231 

337865 CH22^lfcAC00550O.GBSCAN.4&5 231 

302585 AA083564 H&249220 H sapiens raRNAforhTD^comptetBCds 231 

336623 CH22J=GENESA5 231 

332854 CH22J=GBCS^1 23 

336978 CH2UGBES384-10 28 

326874 Ca20jsgi|6682507 23 

315121 AA565011 Hs.105902 ESTs 23 

311185 AI638294 Ks224665 ESTs 23 

334682 CH22J=GBES.419_4 23 

316845 AW418715 Hs250388 ESTs 279 

331599 N74626 Hs30535 ESTs 279 

315681 AW022054 Hs.136591 ESTs 278 

313012 AI2D7390 Hs.143929 ESTs 278 

313476 AA010267 EST duster (not In UniGene) 278 

327277 CH.01Jisg?5867473 278 

310981 AM94514 Hs.171380 ESTs 278 

335090 CH22JGENES.490J 277 

328581 Ca07jBgpX)6033 277 

333219 CH2£j=GENES.1Q4Jl 277 

308311 AI581855 EST singleton (not to UniGene) wffli exon 177 

329760 Cai4_p2gj[6048280 277 

303925 AW469999 H&25B523 ESTs 277 

337628 CH2^.C20H12GB<SCAN^W1 277 

333520 CH2*J=GB*E&174_3 277 

303168 AAS72479 Hs. 197770 ESTs; WeaWy skubr to estrogen-respons 276 

313451 AW138189 Hs.122672 ESTs 276 

328474 Ca07Jeg(5ffi8446 276 

331988 AA477414 HsS242 puiine^ichefementbinolng protein B 276 

308180 AA922503 EST smgteton(not in UniGene) wffli exon 276 

321071 AA013011 Hs7415Q2 Cdc42 effector protein 4 276 

302972 W73400 EST 276 

305185 AA663985 H&248038 major Wstocoapafib% complex; ciass 275 



NOKS9, PRSC.con, PRSCJog 

PRSCjxm,Caco2.CAUJ6 

NOH69, NCW345, PRSCjcon 

EB, BT474, KT29 

MB^DA453.MCF7,DU145 

PRSCjcon, PRSC_fcg, NO-H69 

PRSCJog, PRSCjcon, RPWE-2 

PRSCUm PRSCJog, RPWE-2 

BT474, MCF7, MB-MDA453 

NCPi69,NOH345, PRSCJog 

OVCA^HT29.MB4UDA-231 

OVCA*.EB,MCF7 

NCW69. LnCap, DU145 

NOH460.NC^H520,NCW58 

BT474. MB-MDA-453, MB-MDA-435s 

OVCA-R. PC3.A549 

NCWtiS, PRSCjcon, PRSCJog 

PRSCjcon, PRSCJog, NCJ-H69 

NCI-H69, HT29, HT29 

Affi-MDA453k MB-MQM53, EB 

CALU6, MB-MDA-435S, BT474 

NOH345. PRSCJog, PRSCjan 

LnCap, PRSCjcon, RPWE-2 

NCW«9, NOH345. PRSCjcon 

NOH69, PRSCJog, PRSCjan 

NCI-K345, PRSCJog, LnCap 

DU145.0VCA4*,HT29 

LnCap, NCW69,NCW345 

NCI-H345, NOH69, PRSC.con 

NCI-H69, PRSCJog, RPWE-2 

CAUJ6, OVCA-R, HT29 

NOH69, PRSC.con. NCW345 

MBM)A453, MB-MDA435S, MB-MDA435s 

NCI-H69, PRSCjcon, PRSCJog 

NQ-H345, PRSCJog, NCI-HB3 

RPWE-2 PRSCLcon, NCI-U69 

MB-MDA-231. NCW345, RPWE-2 

EB,UCap»DU145 

NO-H69, PRSCjcon, PRSCJog 

PRSCjcon, NO-H69. FPWB2 

NCI-H345. NC W69, PRSCJog 

LnCap, A549, CALU6 

OVCA-R, Caco2, fv»-MDA-453 

RPWE-2, BT474, MCF7 

CacoZ NO-K23, BT474 

EB,DU145,MBM5A-453 

NCJ-H345, PRSC.con, NO-HB9 

RPWE-2, PRSCJog, PRSC.con 

PRSCjcon, NO-H345, RPWE-2 

RPWE-2, NOH345, PRSCJog 

NCW345, PRSCJog, RPWE-2 

NCW69, NOH345, PRSCJog 

NCW69, PRSCJog, RPWE-2 

RPWE-2, NO-H345, PRSCJog 

A549. M&MDA453. MBABM35s 

NCI-H4Q), MB-MDA-453, MCF7 

DW45.llffi4©M53.MCF7 

NCI-H520, NCI-H460, KT29 

DU145.CALU6.S 

LnCap, RPWE-2, NOH460 

NCW69, PRSCJog, PRSC.con 

HT29.MB-MDA453,MCF7 

NCI-H69, PRSCJog, NCLH345 

N&MDA-231,HT29,CALU6 

CALU6, DU145, EB 

NCWiB9, LnCap, M&-MDA-231 - 

NC^H69, LnCap, mWM53 

NOH69, NCW345, PRSC.con 

DU145, OVCA-R. MB-MDA-453 

OVCA-R, EB, DU145 

NC1-HB9, NO-H345, RPWE-2 

MB-MDA-435S, A549, OVCAR 

NOH69.DU145, LnCap 

PRSCJxj, PRSCcon, NCW345 

NCW45, RPWE-2. NO«Q 

DU145,A549,BT474 
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335998 


OC^GENES.656 J 6 


0 "7C 


319138 R11699 


Ks.73818 iibk;i])nokytochnm) c reductase hinge p 


2.75 


336387 


CH22J r GBJES.822J 


2.75 


338054 


CH2^_0AACOO55OO.GENSCAN, 158-2 


2.75 


316041 AA7191B3 


ESTduster(nothUnK3ena) 


2.74 


336863 


CH22J=GB4E&2974 


2.74 


335975 


CH22J=GENES.652J5 


Z74 


302952 AF103179 


EST 


2.74 


326122 


CH.17_hsgiI5867194 


2.74 


337427 


CH22JGENE&7614 


2.74 


308063 AM69244 


Hs.119252 turner protein; translatfcnaltj^coiitroJle 


2.74 


325433 


CH.12JJ3 glp866936 


2.74 


316252 AI572633 


Hs.190405 ESTs 


2.74 


310837 AI418688 


Hs.170301 ESTs 


2.74 


313562 AW467335 H&257676 ESTs 


2.74 


335455 


CHZLFGENES.562J5 


2.74 


304792 AA583101 


H&29797 noosomal protesi L10 


2.73 


331979 AA469937 Hs.105322 EST 


273 


336198 


CH22J : GB^ES.719J2 


273 


314698 AI660452 


Hs.187127 ESTs 


2.73 


307954 AW19692 


EST singleton (rwt in UniGene) wBh exon 


173 


318288 AI088590 


Hs.134702 ESTs 


2.73 


327833 


CH.05J)Sgfi5867968 


273 


300221 AW44S8Q2 H&217953 ESTs; Highly simter to NK-TUMOR RECOGNI 


326039 


CH17Jisgil5867179 


2.73 


318457 A1149678 


Hs.143952 ESTs 


2.72 


336753 


CH22.FGENES.1284 


272 


330086 


CH.19_p2gqS015293 


272 


333566 


CH2*J=GENES.183J 


2.72 


339384 


CH22_BA232E17.GBISCAN^22 


2.71 


338668 


CH2^BAAC005500.GENSCAN.465-1 


2.71 


300798 AB82618 


Hs.194613 ESTs 


2.71 


303745 AI142379 


EST 


2.71 


305197 AA666301 


EST singleton {noth UniGene) with exon 


2.71 


338725 


CH22_EIAACC0550aGENSCAN.499-1 


2.7 


307799 AI351112 


EST singleton (not hlWGeneJwBh exon 


2.7 


309598 AW173642 H&250106 EST 


2.69 


302727 L10141 


EST 


Z69 


308544 AI695133 


EST singleton (not in UniGene) wifli exon 


2.69 


322877 AA079727 


EST duster (not in UniGene) 


2.69 


325695 


CH.14.hsgip552446 


2.69 


307728 A1335557 


EST singleton (not h UniGene) wffli exon 


£68 


302399 N79624 


EST 


2.68 


309343 AW028652 


EST singleton (not h UniGene) with exon 


Z68 


339360 


CH22JA354i12.GBJSCAN.32-2 


2.68 


337821 


CH2^tAC005500.G^SCAN.13-11 


2.68 


337338 


CH22JGENES.717-7 


2.68 


334510 


CH22_FGENES398_8 


2.68 


300918 AA491286 Hs.128792 ESTs 


2.68 


335536 


CH22J=GENES.574_2 


2.67 


335311 


CH2£J=GENES£32J 


2.67 


338959 


CH22JXB2I10.GENSCAN^31 


2.67 


339081 


CH22.DA59H18.GB4SCAN.37-10 


2.67 


334068 


CH22JFGBC&327_23 


2JST 


338976 


CH2?_PA59H18.GB4SCAN.1-3 


2.66 


325524 


CH.12J»g$866981 


2£8 


333069 


OC2J : GBES.76_5 


2j66 


336203 


CH22JGENES.719_7 


2.6S 


333133 


CH22J=GBJES£3_9 


2.6S 


304074 T77842 


Hs.142528 ESTs 


2.65 


330919 AA224594 Hs.86941 ESTs 


255 


333248. 


CH23JGBJES.115J5 


2.65 


336665 


CH22J=GENES.42-5 


2.65 


315322 AA770599 


EST duster (not in UniGene) 


2.65 


307474 AI264023 


EST singleton (not In UniGene) with exon 


2.65 


323221 ALD50O2O 


Hs.127384 DKFZP564C1 96 prater. 


2*5 


301767 AW361892 


EST 


2*5 


327246 


CROIJjs 9^5867547 


2J65 


337403 


CH2^FGENEa752-2 


2.65 


328221 


Cri06J»gip868099 


2.64 


336759 


CH22JGB4ES.133^ 


2.64 


327532 


CH.0SLhsgi|6469818 


2.64 


305621 AA789095 


EST singleton (not in UniGene) with exon 


2.64 


322931 AA099329 Hs.151764 ESTs 


2.64 


327278 


CH.01jBgip867473 


2.64 


332235 N51413 


Hs.109284 ESTs 


2.64 



NCW69, PRSC.con, RPWE-2 

NCW345, NO-H69, PRSCjoon 

PRSCjm RPWE-2, PRSCJog 

OVCA*,ffi,DU145 

DU145>MCF7,ll&MQA453 

MB-MDA453, MCF7, 0 VCA-R 

CAUU6, EB, A549 

CALU6, MB-MDA435S, BT474 

HT29,Caco2,PC3 

RPWE-2, NO-H69. PRSCJog 

NCW58,NCW23,Caco2 

NCI-H345, PRSC.con, RPWE-2 

OVCA*,MCF7.A549 

NC W345. PRSC.con, RPWE-2 

HT29.MCF7,MB-h©A-231 

NCI-H69, LnCap, PRSCjoon 

EB,OVCA*.MBM>A453 

MCF7,BT474.NaH460 

NCI-H69, PRSC.con, PRSCJog 

MB-MDA-231, LnCap, BT474 

KT29, KT29.EB 

PRSCJog, NCWi345, PRSC.con 

BT474,PC3>M&MDA-231 

2.73 NOH520, NCl-358, M&WDA453 

MB^OA453,EB,EB 

PRSC.con, PRSCJog. NOH345 

MB-MDA435s,NC}-H520,MCR 

KT29,MB4\ADA453,MCF7 

ftT29.OT474.aVCA* 

NOU69, NCW345, PRSCJog 

NC1-H69, RPWE-2, PRSC.con 

PRSC.con, NC1-H345, PRSCJog 

PRSCJog. PRSC.con, RPWE-2 

EB.NCJ-H520.OVCA-R 

CALU6, MB-MDA453, PC3 

HT29,BT474 f MCF7 

NCW58.NOK69.NCI-H23 

OVCA-R.BT474.PC3 

KT29, CALU6, MB-MD*435s 

NCW345. NCI-H69, PRSC.con 

NCkH69, NCI-H460, NOH460 

NCI-H69, PRSCJog, NCI-358 

NCI-H69, PRSC.con, NCI-H345 

H729. MB-MDA531, M&M3A-231 

NCI-H69, PRSCJog, PRSC.con 

PRSC.con, PRSCJog, PRSCJog 

NO-H69, PRSC.con, PRSCJog 

NOH460,NaH23,NCM58 

NB-MDA-435S, CALU6, DU145 

NC1-H69, NOH345, PRSCJog 

M&MDA435S.C8C02.A549 

NC1-H345. PRSC.con. NCI-HB9 

NCUG45, RPWE-2, NCW69 

PRSC.con. RPWE-2, PRSCJog 

PRSC.con, PRSCJog, RPWE-2 

NCI-H345. RPWE-2, PRSC.con 

NC W69, NOH345. PRSC.con 

OVCArR.PC3.A549 

HT29.OVCA-R.A549 

DU145,CALU6,EB 

PRSC.con, RPWE-2, LnCap 

NCI-H345, PRSC.con, MB-MDA-231 

NC1-H69, PRSCJog, PRSCjoon 

A549, M*M>A-453, MB-MDA-435S 

NCI-K69, NO-H345, RPWE-2 

M^MDM53,MCF7 I HT29 

NCW345, PRSC.con, PRSCJog 

EB,OVCA*,DU145 

PRSC.con, PRSCJog, RPWE-3 

MCR,MB-MDA331,BT474 

NO-H69, PRSCJog. PRSC.con 

PC3.CALU6.A549 

HT29, M&MDA231, R&WDA453 

PRSC.con, RPWE-2. NCJ-K345 

EB,NCI4i460.NCm69 

DU145.ffi.OVCA* 
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332792 CH2*J=GENE&3_2 2& 

312340 AW Hs.176333 ESTs 223 

337484 CH22J5B<ES.79W ?63 

325783 CR14Jisgq64567B0 223 

303672 AW5Q2380 K&210527 ESTs Z63 

306009 AA894560 EST sfcg)eicn(irth UniGene)*^ exon Z63 

308543 AI695484 EST sinp^{nrthUnfcene)*a exon 223 

33793) CH22_a^C005^(^SC^L81-3 222 

327791 CK05Jisgi|5867g77 222 

33093 AA232678 Hs27073 ESTs 2.62 

32729 CR01jBojp867454 162 

302150 AF061756 Hs. 152531 heart and neural crest oerivafives expre Z61 
304881 AA598501 Hs. 1951 88 glycefaktehyd&^phosphate dehydrogenase 221 

335956 CH22_FGB<ES247_3 221 

326508 Oi19Ji3gip867435 221 

335863 CH22J=GBCa629J 221 

334752 CH22_FGENES.428_1 221 

3332® CH2JLFGBES.128 19 221 

306709 AHJ24215 Hs.131477 EST 221 

305816 AA854776 EST singleton (not tn UniGene) with exon 22 

327264 CH.01JB ^5867461 22 

310205 AW075527 Hs.252259 rfboscmaJ protein S3 22 

324492 AA479507 Hs.135179 ESTs 22 

322649 AA525549 EST duster (not In UniGene) 26 

329384 CRXJ«p#868869 26 

321240 M62378 EST duster (not fn UniGene) 26 

302751 AA299576 Hs.156110 ImmiffiogtobuBn kappa variabte 103 229 

3)5841 AA860348 EST singleton (not In UniGene) with exon 229 

32410) AA402242 Hs. 122799 ESTs 228 

334196 CHZLFGBJE&353 4 228 

338451 CH22_B^C005500.GB4SCAN.359-39 228 

300333 AW237396 Hs^27052 ESTs 228 

3)5046 AAS32201 EST shgteton (not In UniGene) wffli exon 228 

3)5648 AA807652 Hs.156110 tmmunogtobuBn kappa variable 1M 227 

3)1744 W22230 EST 227 

329182 CRX_h3 $6056331 227 

318178 AW137425 Hs.158401 ESTs 227 

330057 CR17_p2gi)B478962 227 

326552 CU19_hsgI|58673Q8 227 

311956 T57085 Hs. 188464 ESTs 227 

327185 CH.01_hsgi]61 17805 227 

302183 NNL00224 EST 227 

327263 CR01_hsgi]6525274 226 

339164 CH22JIA59H18.GENSCAN2&4 226 

332763 AAC63554 H&90959 ESTs 226 

330579 U67733 Hs. 154437 phosphafiesieiase 2A; cGMP-sOmuiated 225 

329948 CR16_p2gij554Q101 225 

300282 AW044305 Hs236131 ESTs; Highly simiar to hcmeoAxnafn-fnte 225 

335448 CH22JFGENES263J5 225 

330959 H09174 Hs.26484 HIRAnnteracfing protein 3 225 

307262 AI202100 ESTsingteton (not in UniGene) wfth exon 225 

335806 CH22LFGBES.616 8 225 

335782 CH22LFGBES.609 4 225 

301703 AW301478 EST " 225 

329018 CHXjT3gi)6249620 254 

329870 CR14_p2gip706435 224 

334504 CH2^J=GBES398J2 224 

304707 AA564846 EST singleton (not in UniGene) with exon 223 

329326 CHXJB935868806 223 

334418 CH22_FGBE&384_5 223 

338124 CH22LEMAC005500.ffi4SCAN.19fr2 223 

318423 A1362671 H&214491 ESTs 223 

333106 CH22_R3B€S20^3 223 

333668 CH22LFGBES4I5J2 223 

333567 CH22J=GBE&184.2 223 

309592 AW172384 EST singleton (not in UniGene) wffli exon 222 

328989 CHi»Jisgp68535 222 

326725 -CHJDJi3giI6552456 222 

3029% AF054663 EST 222 

335733 CH2Z_FGBE&601 3 222 

336000 CH22_FGBIES258~1 222 

327774 CH.Q5_hsgIj5857964 222 

328557 Ca07_hsgip863489 222 

328228 Ca06jjsgp868105 222 

32B305 CH.07Jisgi|6004478 222 

334010 CH22JGBES213J 221 



HT29.Caco2.A549 

NeW5e.Ntt358.HT29 

NttH69, NttH345, PRSC.con 

EB, OVCA-R PC3 

PRSCJjg.NCWC45.NttH© 

HT29,MB^D/V4Z31,CALU5 

PC3.A549.NCW58 

PC3.0VCA*,MCF7 

PRSCJog, PRSC.con, NCU&45 

OVCA-R, MCF7, LnCap 

NttH345, PRSCjXMi. RPWE-2 

OVCArR.PC3.A549 

MMADA435S, NttW23, MCF7 

DU145.PRSCjBon.PC3 

RPWE-2, NCI-H460, NCW58 

PC3.HT29,Ntt358 

PRSCjcon, NCI-H69, PRSCJpg 

HT29,Ntt358,Caco2 

MB-MDA-435S, MCF7, BT474 

MB-MDA453, MCF7, MB-ftfl)A-435s 

MB-MDA435S, MB-MEJA-435S, M&A4DA453 

OVCA-R, EB.DU145 

DU145, EB, OVCA-R 

PRSC.con, RPWE-2, PRSCJog 

NO-H69, NCW345, PRSC.con 

BT474.CALU6.M&M)Ar231 

MCF7, M&MDA453.0VCA* 

NCW345. PRSCJog, PRSCjcon 

EB.PC3.HT29 

NC1-H345, NttHS9, PRSCjcon 
MB-MDA435S, NttH23, MCF7 
PRSC.con. PRSCJog, NttH69 
NttH460. M&WDA453, MB-MBA-435s 
PRSCjcon, RPWE-2. NttH345 
PRSCjcon, PRSCJog, NCI-H345 
PRSCjcon. RPWE-2, NC W345 
MB-MDA-231, PRSC.con, BT474 
NCI-H345, RPWE-2, PRSC.con 
NttH345, PRSC.con, RPWE-2 
HT29, MB-MDA-453, NCI-H460 
CALU6,HT29,EB 
MCF7, PC3. OVCA-R 
PRSC.con. NttH69. PRSCJog 
NCI-H69, PRSC.con, NttH345 
RPWE-2. NttH345, PRSC.con 
HT29.CAIU8.PC3 
NCW460, MCF7, MBM3A-453 
NttH460, NttH23. NttH23 
MWVtoAr453.BT474.MCR 
MB-MD A453, HT29, MCF7 
MCF7, DU145, MBM)A-435s 
NCI-H345, NCJ-H69, PRSC.con 
Caco2, MB-M2A453. MB-MDA435S 
PC3.MCF7.MB4BA453 
NttH69. PRSCJog, PRSC.con 
NttH23,NttH460,Ntt358 
HT29.BT474,M&MDfc231 
NOH520, EB, NOH460 
MB-MDA-231, NttH345, NCJ-HB9 
NttH23,Ntt358,NCW460 
NCI-H69, PRSC.con, PRSCJog 
OVCA-R, EB, DU145 
NCI-H69, PRSC.con, PRSCJog 
NttH69, PRSCJog, PRSC.con 
NttH69. NttH345. PRSC.con 
LnCap, mMSS, DU145 
MB-MDA-435S, OVCAR EB 
PRSC.con, NttH345, NttH69 
HT29.BT474.CALU6 
NttH69, PRSCJog, NttH345 
UCap,OVCArR.DU145 
DU145.CALU6.HT29 
MB-MDAp453, RSWDA4353, JCF7 
NttH69, NO-KJ45, PRSC.con 
NttH69, NO-H460, PRSCJog 
NttH69, PRSCJog, PRSC.con 
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339033 CH22J3A59H18.GENSCAN.26-1 151 

335340 CH22^FGB^ES.535J7 Z51 

300156 AB455B2 Hs^33395 ESTs 231 

305880 AA886065 Hs.156110 Immunoglobulin kappa variable 1 D-8 15 

310841 A1968009 Ha232Q24 ESTs 25 

336908 CH2^J=GBES^2 15 

304674 AA541735 EST singleton (not In UniGene) with exon 15 

314521 AW503939 Hs.107149 ESTs; Weakly stoiBarto PTBASSOCIATED S15 

307592 AB85739 EST singleton (not to UniGene) with exon 15 

331476 N26190 Hs.43768 ESTs 15 

325803 CR14J»gJf5552451 15 

306549 AA9937S6 EST singleton (not In UniGene) wflh exon 2.49 

304833 AA586504 EST stogteton (not In UniGene) wiih exon 149 

336333 CH22J=GENES.813.i 2.49 

332320 T71134 Hs.100551 EST 149 

328236 CHJ»_hsgIt5868117 149 

317335 AI656979 Hs.130210 ESTs 149 

339188 CH22_PA59H18.GB4SCAN72-16 148 

334235 CH2ZJGENES361J9 148 

301214 AW450950 Ks. 157034 ESTs; Wealdy similar to Unknown [H^apte 148 

332843 CH22JGENES.19J 148 

337431 CH2*_FGBC$.763-7 148 

336757 CH2^JGBCSu131-1 148 

305403 AA723748 EST singleton (not In UniGene) wfth exon 148 

330065 CH.19_p2oJ6165044 148 

309245 AJ972447 EST singleton (not to UniGene) wffli exon 148 

328876 Ot07Jisgq6525286 147 

333944 CH22^FGENES^2 147 

328504 CH.07Jlsgi]5868471 147 

338120 CH22_EKtAC005500.GENSCAN.195-1 147 

306710 A1Q24221 EST singleton (not to UniGene) wito exon 147 

305064 AA636012 EST singteton (not to UniGene) wift exon 147 

329995 CR16_p2gi}4567166 147 

315694 AI821743. Hs.168418 ESTs; Moderately similar to IH! ALU SUB 146 

331004 H64622 Hs32748 ESTs 146 

305259 AA679225 EST stogteton (not to UniGene) with exon 146 

304576 AA496563 EST singleton (not to UniGene) with exon 146 

318887 R60487 Hs.21065 ESTs 146 

308954 AI868958 EST stogteton (not to UniGene) wiih exon 146 

301140 AI807692 H&207128 ESTs 146 

322085 AA088500 Hs.170298 ESTs 146 

339130 CH22.DA59H1&GENSCAN.5W 146 

337612 CH22.C20H11GENSCAN.22h5 146 

313765 AW206181 Hs.185981 ESTs; Weakly similar to gag [Rsapfens] 145 

311665 AW294254 K&223742 ESTs 145 

328620 CH.07_hsgi|5868241 145 

305361 AA7089Q2 EST singleton (not to UniGene) wito exon 145 

336243 CW2./GENESJ46.1 144 

320299 H08323 Hs.177181 ESTs 144 

302535 H48676 EST 144 

333465 CH22LFGENES.160J2 144 

334109 CH2£JGENE&330_8 144 

301749 F12998 Hs.90790 ESTs 144 

324575 AW502257 EST duster (not to UniGene) 144 

337114 CH2*_FGENES.494-17 144 

336087 CH22LFGENES.688J6 144 

315678 AB57119 Ks.120036 ESTs 144 

333258 CH2?J=GENES.118J6 144 

303798 VD0505 Hs.36977 hemoglobin; delta 144 

309759 AW268822 EST stogteton (not to UniGene) w8h exon 144 

318946 AI122843 EST duster (not to UniGene) 144 

321986 AL133656 EST duster (not to UniGene) 144 

338151 CH2£^iACC05500.GENS€AN.207-5 144 

327056 CH21Jsgi|6531965 144 

309605 AW182800 EST stogteton (not to UniGene) wflh exon 143 

335783 CHZLFGENES.610J3 243 

325790 CK14_hsgi]6381957 2.43 

339342 CH2*_BA354I12.GENSCAN.27-10 143 

335777 CH2^FGe€S.607_13 143 

309972 AW450350 Ha257283 ESTs 143 

308716 AI788009 EST singleton (not to UniGene) with exon 143 

338087 CH22_0AACOO55OO.GENSCAN.174-16 143 

306930 A1124518 EST stogteton (not in UniGene) wifli exon 143 

319032 AW409728 Ks.80449 ESTs; Weakly similar to cytoplasmic dyne 143 

30433) AA157834 EST stogteton (not to UniGene) with exon 143 

320638 R54766 Hs.101120 ESTs 2.43 



NCW69,NCi«45,PRSCj»n 
NO-H69, PRSC.con, PRSCJog 
PRSC.con, PRSCJog, NCW345 
EB. OVCA-R, DU145 
Lf£ap,NCW58,CALU8 
NOH345, RPWE-2, PRSCJog 
RPWB-2.NCRi69.MCF7 
NCW460,EB,Caco2 
PRSCjcon, NOH345. PRSCJog 
NCI-H345. NCI-H69, PRSCcon 
NC W345, RPWE-2. PRSC.con 
A549, OVCA-R, CALU6 
MCF7, DU145, LnCap 
NCW345, PRSCjoon, PRSCJog 
NCI-H345, LnCap, RPWE-2 
PRSCjoon, NCW345, PRSCJog 
MCF7,M*MDA-453,PC3 
NOH69, PRSCjcon, PRSCJog 
NCWH520, MB-MDA-453, A549 
HT29.A549.A549 
DU145, CALU6 f EB 
PRSCjcon, RPWE-2, NCi-H69 
NCttt69, PRSCJog, PRSC.con 
NO+C3.DU145, OVCA-R 
PRSCcon, PRSCJog, NCW69 
MB4S3A-231, NCI-H69, HT29 
MB-ffflAr231.CALU6,PC3 
NOH69. RPWE-2. PRSCJog 
UlCap, M&WOM53, MB4iffiM35s 
MB-MDA-231, NO-H69, PRSC.con 
OVCA-R, EB, LnCap 
NCW69, RPW&-2, PRSC_con 
OVCA-R, DU145, M&MDA453 
EB.A549, LnCap 
B,MCF7,MfrMQM35s 
PRSCjcon, NOH345, RPWE-2 
PRSC.con, RPWE-2, PRSCJog 
NCI-H345 f Caco2.Caco2 
PRSCcon. PRSCJog, RPWE-2 
OVCA-R, MB-MDA-231, HT29 
PRSCJog. PRSC.con, NCI-H345 
NO-H345, PRSC.con, RPWE-2 
EB,A549,Caco2 
RPWE-2, PRSCJog, PRSCjcon 
PRSCJog, RPWE-2, PRSCjcon 
MB-MDA-453, MCF7, MB-M3A-435S 
KT29,MB-MDA4359,A549 
OVCA-R, MB-MDA453, \©-MDA-435s 
PRSC.con, RPWE-2, NCI-H345 
' MB-MDA453, EB, DU145 
NO-H69. PRSCjcon, PRSCJog 
NOH69, NCW345. PRSCJog 
NO-H345, RPWE-2, PRSCJog 
NCW345. PRSC.con. RPWE-2 
NOH69, PRSCJog, PRSC.con 
PRSC.con, CacoZ, PRSCJog 
NCW5&\PC3,NCW23 
MB-MDA431 , HT29. CAUU6 
MB-MDA-435S, MCF7, MB-MDA-453 
MB-MDA453, EB, MCF7 
PC3.0VCMIDU145 
DU145,CALU6,CALU6 
PRSC.con, PRSCJog. RPWE-2 
PRSC.con, NCW345, RPWE-2 
NCW58,NCW23,NCW520 
PRSC.con, PRSCJog, NCW045 
M8-MDA-435S, MB-MDA-453, M&MDA-453 
BT474, M&MDA-231, MB-MD*453 
DU145.S.BT474 
MCR.MB-MDM53, OVCA-R 
NC W345, PRSC.con, PRSCJog 
DU145, PC3, CALU6 
NCm69,MCF7,BT474 
RPWE-2, A549, NOTO 
MB-MDA-453, PC3, OVCA-R 
MCF7, M&M3M35S. I»MD&453 
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335281 

317431 AT575790 Hs.132453 
306511 AAS88891 



333420 
338113 



329164 
336316 

310831 AJ927594 

327334 

334017 

308138 A1434446 
333074 

306546 AA393109 
336516 

308791 AI042387 
32S411 

308659 AI75Q091 

3135D4 AI190405 

326073 

334047 

325464 

334764 

312737 AKB35G0 
306591 A&0Q248 
333582 
337843 
335284 

305134 AA653159 

335527 

336795 

303144 AF2Q2889 
334948 



Hs.132895 



Hs.146246 



322929 AI365585 
333561 
338239 

3236TO AL040411 
305903 AA873085 
312573 AW297673 Hs.190526 
334470 
333272 

304010 AW518383 Hs.177592 
337316 

316769 AI914939 Hs212184 



T98872 Hs.194181 
AK71992 Hs.143631 



331223 
337172 
300625 
337092 
334528 
338411 
331344 
334044 
333918 
317168 
333424 
317779 
315142 
310471 
325049 
305234 
337760 
311502 
337548 
326981 
309600 



AA357S27 Hs.70208 



AI042614 Hs.125910 



AW450515 

A1380577 

AW270515 

AW410339 

AA670431 



AW204380 Hs^08662 



AW182066 



327937 



303607 AL046388 H&208206 
304227 N94974 Hs.75344 
314101 AW452279 H&257542 
325026 AI671168 Hs.12285 



CH22_FGBE&524_4 

EST3 

EST sinojaton (not hlinK^)wa axon 
CH22J^£Sw133_4 
CK07Jisgip8^417 
CH2ajGBES.146J1 
CH2^BitAC00550aGB4SCAN.188-13 

CHJUo pJ5868691 
CH22J=GB*ESJS9J1 
Hs.161142 ESTs 

CH01Jisgip9Q2477 

CH2^J=GBES315^ 

ESTsbigtetm{notlnUniGene)waiexDn 

EST singleton (not In UnSGene) wfflt son 
CH22J=GB£&a36J 
EST s!np£ton(ndhUniGene)w& exon 
CtiX_hsgq6682549 
EST smgteton (not th IMGene) wffli exon 
Hs.143127 ESTs 

CH17Jisg?6882495 
CH22JGENES328_5 
CH12Ji3gij5B66947 
CH22_FGe^a428_13 
ESTs 

EST singleton (not hi UnlGene) wfft exon 
CH22_FGB(ES^01_2 

CH22_BtAC005500.GENSCAN^fl 
CH7? J RPMFRfPg fi 

EST skigtetan (not fcTuniGene) wffii exon 
CHZlTGaiESJSnj 
CH22J=GBflES.17&-5 
EST 

CH2*J=GENES.465J5 
CHiHJts gq638192B * 
ESTs 

CH22J=GENES.180J8 
CH2^JEMACffl)5500.GENSCAN26« 
Hs.161763 ESTs; WeaMy similar to WAA0738 protein 
EST singtsten (iwt h UraGeno) wim exon 
ESTs 

CH22J=GENE&394J 
OgLFGB^ES.122_1 
ribosomal protest; laige; Pi 
CH22_FGENES£92-1 
ESTs 

CH22J=GENES.7B3_4 
ESTs 

CH2^J=GB4ES^2 
ESTs; WeaWy stmfer to WASP-femBy prot 
CH2^J=GENEa465-12 
CH22J=GENES.402_8 
CH22_BtAC005500.GENSCAN341-7 
ESTs 

CH22J=GENES323J2 
CH22J=GENES.296_7 
ESTs 

CH22J : GB0.147j4 
ESTs 
ESTs 
ESTs 

ESTs; WeaWy similar to centaurin beta2 
EST singleton (notfn UniGene) wBh exon 
CH22_BAAC000097.GENSCAN.11W 
ESTs 

CH2SJGBESJ44-5 
CH21Jts 9^6588016 
EST singleton (not in UnK3ene)wffli exon 
CHXS_hsgii586B500 
CHXSJb 91(5868192 
Oi07Jsgp68353 
ESTs; WeaHy sWar to N afl alpha note 
rfboscmaJpro^S4;X-Bnked 
ESTs 
ESTs 



Hs.128381 
Hs.190219 
Hs.149596 
Hs^56310 
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2-37 


BT474,EB.OVCA^ 


2-37 


DU145,CALU6,CALU6 


2-37 


LnCap, PRSCJog, NCI-H345 
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315015 AJ659989 Hs.132625 ESTs 2.37 

328662 CR07JisgI]6G04473 237 

305867 AA864572 EST singleton (not In UniGene) wfth exon 237 

333296 CH22_FGENE&13*J3 237 

331070 R01116 Hs.182059 ESTs 236 

333698 CH22J=GENES^50J2 236 

316423 AA758756 Hs. 121 380 ESTs 236 

323189 AL121194 Hs. 12039 ESTs 236 

318889 Z43296 Hs. 18720 pragiannmed eel deaft 8 (apoptDsb^nduc 236 

334237 CH22_FGB*ES36*_1 236 

315931 A1700148 Ha 11 7328 ESTs 236 

326884 CR20Jsgi|66B2511 236 

333132 CH2i/GENES33J 236 

306574 AA995719 Hs.76067 neat shock 27kD protein 1 236 

324416 A1669524 Ha.194115 ESTs 236 

329496 CH.10_p2gI|3983518 235 

320994 H22381 EST duster (not in UniGene) 235 

320481 AA461139 HsJM372 ESTs; Weakly simBar to AI207H1 .1 JRsap 235 

309958 AW444488 EST singteton (not in UniGene) wfth exon 235 

327009 CR21J»gip867664 235 

309594 AW172821 Hs.181165 ortajyofcrjanslat™ 235 

335468 CH2*J=GBIES£67_4 235 

304269 AA0S9029 EST singleton (not In UniGene) with exon 235 

305877 AA865649 EST singteton (not in UniGene) with exon 235 

305700 AA815428 EST singteton (not In UniGene) wift exon 235 

326423 CH.19_hsgi|5867369 234 

334560 CH22_FGENE&404_3 234 

337100 CH22JH3ENES.472-3 234 

301505 AWD14374 Hs.144849 ESTs 234 

312142 AW298359 Hs221069 ESTs 234 

305787 AA845035 EST singteton (not h UniGene) with exon 234 

338686 OC2^tAC005500.GBlSCAN.472^5 233 

331977 AA465207 Hs.125887 ESTs 233 

314687 M79114 Hs.135177 ESTs 233 

336089 CH22JGBJE&688.18 233 

338952 CH22JXJ32110.GENSCAN.23-22 233 

334612 CHZLFGENES.41 1 J 1 233 

338223 CH2^JNtAC005500.G9ISCANmiO 233 

327845 CHJJ5_hsgi)6531962 232 

308187 AI538108 Hs.1 56110 Imrrainogbbufln kappa variable 1 0-8 232 

317767 AW294164 Hs.128340 ESTs; Weakly similar to Cdc42 GTPase-act 232 

330468 L10343 Ks.112341 protease inWbtor 3; sWrvderfved (SKAL 232 

319003 R17712 EST cluster (not fa UniGene) 232 

323022 AJ 066733 Hs.133865 ESTs 232 
303148 R73167 Hs.1 2731 7 ESTs; WeaWy amfar to CYTOCHROME P450 

303215 AW250314 EST 232 

318891 H10477 Hs.196208 ESTs; WeaWysfrnBarto flil ALU SUBFAMI 232 

336653 CH22J=GBIES334 232 
333329 CH22_FG£NES.138^22 232 
301980 U69962 Hs.121498 potassium voftage^ated channel Shatwe 231 
336968 CH22JGENES375-28 231 
308539 AI694191 EST singleton (not in UniGene) w3h exon 231 
326417 CK19J»gip867362 231 
328851 CK07Jsgii6381923 231 
329254 CHJChsgi|5868733 231 
303075 W88779 Hs.59125 ESTs 23 
335131 CH2SJH3ENE&497 15 23 
303129 AA308334 Hs.172210 MUF1 protein 23 
327067 Ca21Jsgq6531965 23 
324064 AW137650 EST duster (not in UniGene) 23 
325965 CH.16JB gp867147 23 
334525 CH22J r GENES.4Q2J 23 

336654 CK22J=GBIES34-2 23 
302348 AF1O0779 Hs.1 94680 WNT1 inducfote signafing pathway protein 23 
309275 AI989570 EST singteton (not In UniGene) with exon 23 
329246 CHXJ» 845668732 23 
305567 AA774834 EST singleton (not in UniGene) with exon 23 
322907 AA084941 EST cluster (not in UniGene) 23 
318633 AT7Q3241 H&202653 ESTs; WeaHy sirriar to Xfn [MLmuscutus] 229 
309233 AB71416 EST singleton (not h UniGene) with exon 229 
308913 A1860692 Hs.1 19122 rfoosomal protein L1 3a TB 
335827 CH2^FGENES.620J 229 
334066 CH2^FGEICS327J1 229 
302656 AW293005 H&220905 ESTs 229 
308974 A1872290 Hs.140 (mmunoglobtilln gamma 3 (Gm marker) Z& 
333607 OESJGENE&216J 229 



PCT/US02/06001 



MB-MDA-453, llteMQAr231, LnCap 

NaH345,RPWE-2,PRSC_con 

MCF7, MB-MDA-453, M&WDA-231 

EB.PC3, CALU6 

OVCA-R, fi/©*fl>M53 ( A549 

HT29, OVCA-R, Caco2 

HT29,MCF7,MBJffiA435s 

PC3.NCW460.DU145 

OVCA-R, A549, M&MDA-453 

NCW345, NO-H69, LnCap 

MCF7,NCWi345,DU145 

A549.EB.PC3 

NCWH69, HT29, EB 

RPWE-2, PRSCJog. PRSC.ccn 

NCW345, RPWE-2. PRSCjoon 

HT29.MCF7.&BWDA-231 

NCm23.A549.CAm6 

PRSCjoon, RPWE-2, PRSCJog 

NC1-H345, PRSC.con, PRSCJog 

HT29, BT474, MCF7 

HT29.DU145.B 

NCW69, PRSCjxn. NCI-H345 

PRSC.con, PRSG Jog, RPWE-2 

A549.MCF7, OVCA-R 

PRSC.con, NCW345. PRSCJog 

PC3.MCF7, LnCap 

HT29, NC W460, K©-MDA435s 

PRSCJog, PRSC„ccn, RPWE-2 

CALU6,MMOA-231,DU145 

PRSCjoon. RPWE-2, PRSCJog 

NCW23, NCW520, NOH460 

BT474, MB-MDA-231, MB-MDA453 

OVCArR,A549, MB-MDA-435s 

NQ-H69, PRSC.con, NCU4345 

PRSCjoon, Caco2, PRSCJog 

PC3.OVaA-R.HT29 

OVCA-R, MB-MDA-453, EB 

DU145.MWlffiA453.MCF7 

OVCA-R, NB-MDA453, PC3 

NCI-H69, NCi-358, PRSC.con 

BT474.CALU6.M&MDA-231 

PC3, Cacc2, HT29 

MCF7,PC3,A/&MDA453 

CALU6, MB-MDA-231, DU145 

232 NOH345, PRSC_ccn. RPWE-2 

NO-H345. PRSCjxn, PRSCJog 

NCW69,LnCap.NCi-H345 

DU145,EB,LnCap 

DU145.BT474.M&MDA-231 

NO-H345, MB4iffiA231, LnCap 

HT29,Brr474.EB 

NCLH345, NOH69. PRSCJog 

HT29,MCF7,BT474 

NCW520. NCWM60, NOH23 

RPWE-2. NCI-H345, PRSC.con 

DU145,OVCA*.a 

NCI-H69, NO-H345. PRSCJog 

LnCap, DU145.HT29 . 

NCW345. NCW69, MB-MDA435s 

DU145. HT29, EB 

NCI-H69, NCI-H345, RPWE-2 

NCW345. PRSCjoon. NCW69 

BT474, PC3, MB-MDA-453 

UiCap,CALU6.DU145 

NC W460, NO-H23. NC1-H520 

NCI-H69, NCI-H345, PRSCJog 

CALU6.CAUi6.MCF7 

MB-M)A-231,CALU6,EB 

NCI-H345, PRSCjxn, RPWE-2 

CALU6,OVCA*.EB 

MWffiA435s.MCF7.HT29 

PRSCjxn, PRSCJog, RPWE-2 

PRSC.con, PRSCJog. NOH345 

NCW23, CacoZ CALU6 

CALU5.A549. NCW69 

OVCA-R, MCF7.A549 
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335t74 


CH22JGBSES5XJ - 


95Q 


HT2Q A549.MB-MDA453 


332228 AA489680 Hs.134406 ESTs; WeaHy stnferto Eftnlp haiMtog M 


ooq 




00041 f 


CH2^J : G9<ESv823_39 




MnUffi9.NCJ-H345.PRSC too 


oZMZo AA251401 


EST ouster (not In UniGene} 


229 


HT2L NBAfflA-231 BT474 


000010 


CH2^FGBES^-1 


229 


KOKffiff NCW46D NCHtiS 


310017 AJ188739 


Ks.148488 ESTs 


CCS 


NCU&A5lPR£C hn.PRSC mn 
liwTTW*tw» rTwwjuy, rnuyjMi 


334055 


CH?2J : GENES^27_6 






337168 


CH22J=GnNE&562-28 


ooft 

CLO 


NO-HG9 PRSC Ion NCLH345 


329824 


CK14_p2g?ffi30758 


O 9ft 


upum rAlirs RpiMC_o 

NUrfi£J, wnUJO, v\TYwCrC 


333831 


CK23JGE3<iES292L13 


97ft 




339127 


CH22J)ASH18.GB^CAN^5-1 


228 


poop rrY) wntKHS RPWF-? 


305686 AA812726 


EST singteton (no< In UnKSera) with exon 


228 


Mrqucon MfMJ499 M<~i_LMCrt 
nLfHTZJ, NUHWOU 


329782 


CH.14_p2g?5912597 


228 


nLrt-nos, nufuw, rTiowjuy 


311059 AI81D001 


Hs.175346 ESTs 


228 




336934 


CH22J=GBCa35M 


228 




314633 AA761093 


EST ouster (not In IhtfGene) 


228 


nunA p UT90 niHAt; 
u vwrf% n 1 uui v} 


331596 K72574 


HsJ5022) ESTs 


228 


ACiQ MTP7 WTUWft 


330729 AA258559 Hs-3736 ESTs; WeaHy sknfer to DELTA-UKE PROTE 228 


imjuf>A_oqi CAIIK MCF7 


338285 


CH2^BAAC0(S500.GB4SCAN^3 


227 


UHLMRO PDQP Inn DOCP rym 


300154 AT245127 


Hs.179331 ESTs 


227 


u/Him MrMucon MPL'Vn 
NUHTtO, NUriKwU^ NOrOOO 


306383 AA969078 Hs,1836S8 ifccsonal protein 129 


227 


KrVlC-Z, nuHUU, rf\Ov_)Og 


309005 A1884454 


EST singteton (not hi UniGs)e) with exon 


227 


AM 9, WJCI , PIVI 


332985 




227 


KrWC-^C, MOrrtWO, rT\Ov_J0g 


337426 


CH2£/GBCS7oi-3 


977 


m MAS PR ! TR 
IAJ140, CO, WUaJO 


337778 


CH223*AC000097.GB^SCAN.11^) 


227 


NUTID9, rnotrjOon, PKoyJOg 


329705 


CH.14JJ2 $6065790 


227 


DDCT* *m DOC/* Un DOUUCO 


335971 


CH2ZJGQ1ESJB52J 


227 


DOQ/* Inn MOJUinA Mf*LW97 


315862 AKJ75846 


Hs.133996 ESTs 


227 


HI Zii, MbMVlJA-WoS, UVUA-K 


316466 AO11204 


Hs.12638 ESTs 


227 


Nw+Ww, NUK»0» BT4/4 


334430 


CH2^JGBC&385_3 


227 


NUHWW, NUHttw, PKSujOOn 


331941 AA452257 H&99272 ESTs 


226 


BOOr iuui I COO/* Inn 

PiwCjcon, Lnuap, pksc jog 


301230 AW269804 Hs.153019 ESTs 


226 


NuHwo, rrtoOJOg, N(^HVmU 


317394 AI935024 


Hs.190518 ESTs 


226 


II/-H UOJIC DOC/* hui CD Of* Inn 

NUHTJ40, rKoujDon, r KoUJOg 


306220 AA92B363 


EST singleton (not in UniGene) wffii exon 


226 


M/H_UOj|C DDCT mm DDC/* Inrv 

NUHU40, rKoojDOn, rKooJOg 


304134 H54627 


ESTstngietan (not In UnJGene) wfih exon 


226 


NHJC f*A| I (C D/*0 

UU14d| UALUo, rvo 


335421 


CH2?_FGENES551_1 


226 


MfM-UCQ DDQT* mn DOQT Ion 


305260 AA679280 Hs.156110 tmmunogtobuftn kappa vaMIeltVS 


226 


NURvWo, NUHnuM, rKou_C0n 


303592 AA421129 


EST 


226 


Ail 1 »c n\JPA D ni MAC 


317982 AI004985 


Hs.130607 ESTs 


226 


PUj, MD-MUA-400S, AMU 


325304 


CH.11Jisgp66910 


226 


MCr7, CALUb, A549 


334118 


C^_FGBES330_19 


226 


OnCf mn M/*l UCO DDCP Um 

PRSCjOOn, NOHioy, PR5K/J09 


335687 


CH22J=GB4ESi596_j2 


226 


AM!), UALUO, uioap 


334035 


CH22JGENESJ32Z3 


226 


NCf-rtJ45, rKoO_COn, KrWc-Z 


305454 AA738413 


EST singleton (not In UniGene) with exon 


225 


ro Lmo /*aiiic 
co, H i £3, UALUo 


335902 


CH22J=GENES.635J0 


225 


CD niMAS UT9Q 
to, Uu 140, n\£s 


339215 


CH2?JF113D11.<KNSCAN.6-10 


225 


DDC/* Ann DDCP bwi DDUfC 9 


325310 


CHG7JisgIf5868327 


225 




337396 


CH2^FGBCa749-1 


225 


CD, LJU 140 


336808 


CH22J=GENES205-3 


225 


MfLWiS WfH-HRfl ewer mn 
k w-nono, rtot-noo, nwv/^ujn 


305808 AA853958 


EST singtelon (not In UniGene) wBh exon 


224 


HP IDA ytCQ Hi HAS PR 

nUHVftJ/V-400, UUl'K), CD 


333571 


CH23J=GENES.188J2 


224 


MTP7 MP MfVA Attn PT^ 


323023 AA225188 H&258539 ESTs 


224 


CD, UU l*KJ, unUJO 


334626 


CHZLPGBIES.41612 


224 


NUH1D9, rtUl-nO'Kj, rt\OL»_Rjg 


333593 


CH23JH3B<ES210_2 


224 


Mrt-HBO K1CLH345. PRCf* rnn 
nOriTDsp fiV/rrw'W, rr\ovy_LLWi 


326708 


CH2DJB $5867583 


224 


KirHJjAsn Mciuri MPLuson 

NbHWw, nUHVtai 


314502 AI041717 


Hs.1 32141 ESTs 


223 


tdntJJfiAK DDVUR-9 DDC/* mn 
NU-H340, KrVVC-fc, rT\oO_C0fi 


309181 A1951727 


EST singleton (not in UraGene) wffii exon 


223 


rrWv_Wl I, i W, IViDnVUJsnncO i 


324926 H56196 


Hs.1 17798 ESTs 


223 


en co HI HAS 
CD, CD, UU IHD 


333632 


CH22JGEJ€S227J3 


223 




328243 


Ca06JsgP56292 


223 


D/^5 1 nf*an 1 


327037 


CH21j6gJI6531965 


223 


InPon niHAS PR 


307380 AI222985 


EST singleton (not In UiriGene)w3h exon 


223 


WUrTw*r3, rrvwv_wih rT\*JVJUy 


334766 


CH2ZJGENES.428J5 


223 


drop fa MCLH345 RPWF-9 


335236 


CH2?_FGENES515_8 


223 


OVCA-R. MCF7 BT474 


336615 


CH22JGBCS.613_5 


223 


MPU-fRtl PRon fan PRCfi mn 


307558 A1281998 


EST singteton (not in UniGene) wffi exon 


223 


ru H45_ OVCA-R. CALU6 


308029 AJ457115 


Hs.62954 fenffin; heavy pofypep&del 


223 


CD, Ift&Tf»#fV*IW* 


331508 N47559 


Hs.46732 EST 


223 


MB-M5A-453, MCF7, BT474 


320980 AJ237672 


Hs214142 5;10fliefliyteft8len^iyaYofDkte reductase 


223 


OVCA-R, EB.EB 


304241 AA010976 


EST singleton (not in UniGene) wffii exon 


223 


BT474, M&WDA435S, M&MDA-231 


314682 AI190864 


Hs.178226 ESTs; WeaWy slmQar to Wl ALU SUBFAM1 


223 


MB^QA-231, MCF7, OVCA-R 


308332 AJ624301 


EST singleton (not In UniGene) wfth exon 


222 


OVCA-R, BT474, CALU6 


314476 AW207857 Ha 169604 ESTs 


222 


DU145, EB, A549 


327864 


CR06Jtsg$868130 


222 


Nd^69. PRSCJog, PRSCjan 


337279 


CH22JGcNES.66&-2 


222 


NCW345.NCKH69.PRSC_.con 


302263 AA325517 


EST 


222 


OT474,NO+e20 i Dm45 
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322840 AAD83710 


EST duster (not In UniGene) 


ZZZ 


307574 A1283549 


ESTsingJBtan(RotInUniGerie)withexon 


777 


319027 AA716612 


EST duster (not bi UnIGm) 


777 


305925 AA877883 


EST singleton (not In UniGene) wfth exon 


777 


329725 


CK14_p2gfp065785 


777 


316194 AW298529 Hs255774 ESTs 


222 


301119 AF142579 


EST 


777 


333815 


CH22JK3BIE&282J 


777 


334358 


CM2JGEN£Smj 


222 


303763 AFO43250 


Hs30928 DNA segment on chromosome 19 (unique) 11 221 


335593 


CH22J=GENE&581_32 


221 


334026 


CH22J=GENES318_3 


221 


322224 AF086054 


EST duster (not in UniGene) 


221 


309836 AW295497 Hs. 157397 ESTs 


221 


332669 M33374 


Hs.661 NADH derfftfogenase (ubiquinone) 1 beta s 221 


307629 AI30Q246 


EST singleton (not fn UniGene) with exon 


221 


300470 T87841 


EST 


221 


330064 


Cai9jj2giI6165044 


221 


338819 


CH2ajDJ246D7.GENSCAN.1-24 


221 


337797 


CH2^3AAC005500.GENSCAN^4 


221 


328025 


CK06J8 g^902482 


22 


326240 


CR17jtsgi|5867260 


22 


312865 AW0G5376 Hs.173280 ESTs 


22 


338450 


CH22_EKtAC005500.GENSCAN35W5 


22 


302532 U60181 


Hs248115 growth hormone secretogogue receptor 


22 


321132 AA081495 


EST duster (not In UniGene) 


22 


337787 


CH2^EMACQ00Q97.GENSCAN.12S3 


22 


337032 


CH22J=GENES.43&3 


22 


300026 M11507 


AFFX control: bansfenin receptor 


22 


333139 


CH22_FGENESJ3J6 


22 


33438 


CH22J=GBESl37^4 


22 


335002 


CH22_FGENE&470_7 


22 


335000 


CH22JGENES.470J5 


22 


337298 


CH22JGEhES.67M 


22 


302461 AF104253 


tte241381 cotactor required for Sp1 transcriptiona 


22 


334819 


CH22JGBIES.436J5 


219 


300428 AW452660 Hs253296 ESTs 


219 


302569 ACC04472 


muffipia UniGene matches 


219 


339401 


CH22.BA232E17.GENSCAN.7-7 


219 


328791 


CR07Jisgip868309 


219 


337333 


CH22_FGBES.71M 


219 


339363 


CH2^_BA354I12.GENSCAM336 


219 


329429 


CH.YJtsgif5868882 


219 


336927 


OGLFGENE&34W 


219 


336351 


CH23JG9CS*16_3 


219 


313466 AA004731 


Ks.148876 ESTs 


219 


307433 AI244895 


EST singtetan (not in UniGene) wfth exon 


219 


336590 

310758 AI770001 


CmfGETCSiiU 


219 


Hs209445 ESTs 


218 


327823 


CH.05Jisgp867968 


218 


313257 N92638 


EST duster (not In UniGene) 


218 


335377 


CH22J=GENE&543J7 


218 


303958 AL042931 


EST singtetan (not in UniGene) with exon 


218 


320153 AF064594 Hs.120360 phosphollpase A2; gpoup VI 


218 


335201 


C822JGENE&508J0 


218 


338591 


CH22_EMAC005500.GENSCAN.4344 


218 


331958 AA455960 Hs.99405 ESTs 


218 


337218 


CH22J=GBiE&614-2 


218 


309470 AW118833 


EST singleton (not fn UniGene) wflh exon 


218 


331896 AA435495 Hs.97174 H sapiens mRNA; cDNA DKFZp566E164 (from 


330275 


CH.G5_p2gi)6671904 


218 


335817 


CH22_FGENES.618_5 


218 


332896 


OE2JGENES35J0 


218 


303294 AA205300 


EST 


217 


338703 


CH2?3^C005a)0.GENSCAN.48O-2 


217 


300115 AC15044 


Hs208130 ESTs 


217 


330979 H2246S 


Hs^1795 ESTs 


217 


317246 AW105092 Ks.155690 ESTs 


217 


33078 


CaXJsg?5868597 


217 


312554 A1222630 


Hs.109390 ESTs 


217 


3232)7 AI052795 


Hs.192201 ESTs 


217 


301894 AA484435 Hs.41997 apoprotein 


217 


329D97 


CaXJsgJi5868624 


216 


328328 


CR07J1S 985868375 


216 


302671 AA522440 Hs.135917 ESTs 


216 


329201 


CHXnsgp68718 


216 



KT29,MB^DA453,CAUJ6 

OVCA*,GALUb\BT474 

LnCap,NCW69,NttH69 

NCJ-H345, NOH69, NO-HS9 

NCWffi9. PRSCjan, NOH345 

CALU6 r EB,NCI-H520 

A549, OVCA-R, ffi 

MB-MDA-4353, EB, MB-MDA453 

NCW345, RPWE-2, PRSC.con 

Caco2,NaH23,NCW520 

NOH345, PRSCJog, RPWE-2 

NCI-H69, PRSC.con, NCWG45 

PRSC_con, PRSCJog, RPWE-2 

NCW345, PRSC.oon, RPWE-2 

Nd-H520,CALU6,OVCA-R 

MB-MDA-231, MB-MDA453, HT29 

PC3, EB, CALU6 

N04«9 I PRSC_con i BT474 

NCI-H69, RPWE-2, PRSCJog 

LnCap,NCt^69,NCm520 

RPWE-2, PRSC_con, PRSCJog 

EB,LnCap, MB-MDA-453 

DU145, DU145, OVCA-R 

MCF7, MB-MDA453, NteMDA435s 

PRSCjwn, PRSCJog, PRSCJog 

NCW23.NCI-H520,NCW58 

EB.PC3, LnCap 

NCI-K69, NOH345, RPWE-2 

HT29,EB,A&MDA-231 

HT29,MB^DA453,Caco2 

PRSC.con, PRSCJog, RPWE-2 

PRSCjxm, NOH345, NCW345 

EB.PC3.A549 

NCW69.A549.HT29 

EB, CAUU6, LnCap 

CALU6,BT474,Caco2 

DU145, CALU6, KT29 

RPWE-2, PRSCJog, PRSC_con 

NCI-H345, NCI-H69, PRSCJog 

DU145.PC3.Hr29 

NC W69, NC1-H345, PRSCJog 

NOH69, PRSCJog. PRSCjcon 

CALU6, HT29, OVCA-R 

NCW69. PRSCJog, NCW58 

DU145,B.R»44DA231 

CALU6, DU145. OVCA-R 

NttH23,NCUi23,NCl-358 

PRSCjxm. NCI-H69. PRSCJog 

EB.&B-MDA-231.BT474 

PRSC_con, NC W69, NCW345 

PRSCJog, RPWE-2, PRSCjxm 

PC3,MB^DA435S,CALU6 

NCW045, RPWE-2. PRSCjxm 

LnCap, PC3, MB-MDA435s 

OVCArR,DU145 l Kr29 

NCI-H69, NCI-H345, RPWE-2 

MCF7.NCW23.NCW460 

CALU6,A549.MCF7 

PC3, EB, MB-MDA-4353 

218 RPWE-2, NCI-H69, PRSCJog 

NC1-H345, PRSC.bg, PRSCjxm 

A549,Caco2,PC3 

NCI-H345, RPWE-2. PRSCJog 

MaWM35s.A549.MCF7 

HT29,Efr474,NCW69 

PC3,OVGA-R.HT29 

MCT7.EB,M*MDM35s 

MB-MDA453.DU145.ffi 

MBMDA453, MB-AffiA-231, BT474 

NO«20,OVC*R.MCF7 

NCI-K69, NQ-H345, PRSCJog 

PRSCjxm, LnCap, PRSCJog 

MB-MDA-231, MCF7, NCJ-358 

NCI-H345, PRSCjcon, NOH69 

BT474.DU145.A549 

OVCA-R, PC3,MB-MDA-435s 
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334435 
330742 



334784 
337771 
300181 
300268 
309575 
336548 



CH.15JJ2 $6634760 2.16 

CH22JGB4E&385J0 £16 

AA400979 K&2S91 cafcaoim receptee nxeptnrat^ £16 

CH07_tegpS8454 £16 

CTC2JGENES.43W £16 

CH223tAC000a97.GBISCAN.119-10 £16 

AJ234955 Hs. 157568 ESTs; [Mjttusc 2.16 

AB39446 Hs^45450 ESTs 

AW168096 Hs.195188 i 



305480 
302270 



AA746500 Hs£5911 
R56151 



327015 
338576 



317253 
302301 



308417 
338177 
337592 
325945 



AW071241 Hs.199685 
R674S3 Hs.127150 

AK40693 Hs£186 



CH22J=GB£S.841_5 
CR07js $5888471 
CK(K_p2 #165182 
HLABz 
EST 

ESTsatgMon (not h UniGene} wSi exon 

CK1$Jisgp867087 

CH^1JsgP67664 

CH2^EMAC005a».GB4SCAN.429-1 

CH2*_FGBC&209_2 

ESTs 



2.16 
2.16 
Z16 
2.16 
2.16 
£16 
2.16 
2.16 
2.16 
£15 
2.15 
£15 
£15 



ESTs; WeaWy sim2ar to ZINC FINGER PROT 2.15 



CH2£_FG£NES.293^ £15 

eukaiyofic bansfafkin elongation factor 2.15 

CH22_3«AC«J5500.GENSCAN21^ 2.15 

CH22.G20H1ZGBISCAN.&7 115 

CU16J»gj$5B67138 2.15 

CH23JG0JES52O.3 £15 

CH2SJGB4ES.244J 2.15 

CH22JGBES250J5 £14 



333665 
333710 

304927 AA604728 Hs.195188 pJyajiaJashy^^ 2.14 



W32480 Hs.157039 
AA928686 



313283 
308221 
333205 

312932 AI804218 1&2Q9614 



33746 
337S64 
337984 
337704 
302162 
303192 
306200 



325409 
308558 
302185 
303021 
301005 



AF119046 
AA081755 Hs.8059 
AA926816 

AW515979 Hs.84298 

AI700145 Hs.172132 
AA243837 Hs.156915 
W39612 

AW451916 Hs£1Q848 



AA736653 

AA584712 
AW298760 



305443 
335485 
304817 
309859 
326206 

303656 AA437189 Hs.122574 
334745 
318504 



303843 
308444 
301322 



320360 
316301 
335371 
301178 
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DU145,Hra.CAUJ6 

EB,DU145,OVCAA 

EB,DU145,0VC^R 

PRSC.con, PRSCJog, RPWE-2 

NOH69, PRSCjoon, RPWE-2 

OVCA^MCF7,fi»*OA453 

C^co2,NCW58,OVCA^ 

MB-MDAr231 , RPWE-2, CALU6 

DU145, MB-MDM53, MCF7 

LnCap,EB,PC3 

NO-H345, PRSCjoon, RPWE-2 

OT474,M&MDA331,A549 

NOH69.RPW&2.MCF7 

PRSC jxxu NCVH345, PRStLtog 

NO-H345, A549, M&WDA-231 

OVCA-R. DU145, CALU6 

Nt>H69, PRSC_con, NCW345 

RPWE-2, NOH345, PRSCJog 

MB-MDA-231, NOH345, PRSC_con 

MBh\DM35s, MB-MDA-231. BT474 

PRSCjoon, PRSCJog, RPW&Z 

DU145,CALU6,S 

NOH460, PRSC.con, KCI-H23 

NOH345, PRSCjoon, RPWE-2 

EB, DU145, CAUJ6 

UCap,NO«69, PRSCJog 

PRSC jcon, RPWE-2, PRSCJog 

N0358,A549 ( HT29 

KT29,J^MDA231,BT474 

RPWE-2, PRSCjcon, MBM)A^31 

PRSCjoon, NCI-H69, NCI-H460 

EB,PC3,LnCap 

A549,NCW58,DU145 

NOH520,CAUJ6 I Caco2 

EB, A549.HT29 

PC3,MBMy<DA^53,BT474 

NCI-H69, RPWE-2, NCWH345 

MB4MDA231.HT29.cB 

MCF7, DU145.DU145 

LnCap,OVCArR.DU145 

PC3,A549,MfrMQA435s 

DU145.CALU6.Hra 

A549.CALU6, NOH23 

MCF7,KT29,A549 

Caco2.A549.MCF7 

EB,LnCap J PC3 

DU145,Hra,CAUJ6 

PRSCLcon, HOm, NOH345 

fTOC_con. RPWE-2. PRSCJog 

EB,OVCA-R,A549 

HT29,dT474,EB 

LnCap,OVCA^,EB 
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309533 AW151131 EST singleton (not in UniGene) with exon 2.03 

338579 CH22^KtAC005500.GBJSCAN.431-3 2JB 

326549 Cai9jisgip867307 2S& 

32)012 AJ628384 Hs.193745 ESTs 2.03 

334111 CH22J=GBJES330J0 2.03 

327123 Ca21J»gII6531971 2.03 

324568 AW502311 EST duster (not In UniGene) 2.03 

306012 AA896989 EST singleton (not in Unffienajwift exon 2.03 

303106 AA012877 EST 2.03 

302194 U52219 Hs.158329 Gpn^iwxnipfed receptor 50 2.03 

326646 CR20jQgij5867562 2.03 

304060 T61464 EST singleton (r>ot In UnJGene) with exon 2.03 

304667 AA535602 EST singleton (not h UnJGene) with exon 2.03 

330514 M63652 Hs33155 propenfl n P factor; complement 2.02 
310324 A1473273 Hs.159674 ESTs; Weakly sbnBarto GLUTAMATE [Rsap 2.02 

330327 CR08_p2g?5919194 2jQ2 

308447 AI6S9985 EST singleton (not in UniGene) wilft exon 2.02 

307778 A1344972 Hs231496 EST 2.02 

319459 T87351 Hs.194121 ESTs 2.02 

300935 AA513644 Hs322815 ESTs; WeaWy simfer to Wistott-AMrfch 2.02 

314318 AL037405 Hs.176141 ESTs 2.02 

334779 CH22J=GBiES.432_1 Z02 

336994 CH22J=GBJES.41(W 2.02 

334076 CH2LPGBffiS^27J1 2.02 

318116 AW452865 Hs.132339 ESTs 102 

326783 CR20Jisgi|6525298 2.02 

336142 CH22JGB4ES.705J 2J32 

320913 AA663733 EST duster (not bv UnJGene) 2.02 

301644 AW239364 EST 2.02 

300944 AW081072 Hs.164624 ESTs; WeaWy similar to Sfr3 protein [ 2.01 

310080 AW137088 Hs.144857 ESTs 2.01 

311248 A1863918 Hs.195078 ESTs 2.01 

319207 R87679 EST duster (not in UniGene) 2.01 

334760 CH22J : GBJES.428J3 2J)1 

338368 CH22JMACG05500.GBJSCAN.32&-2 2.01 

317300 AI417007 Hs.166338 ESTs ZO/I 

323699 AW178750 EST duster (not in UniGene) 2.01 

301366 AA907713 Hs^21667 ESTs 2.01 

333308 CH22J=GB*ES.137_3 2.01 

328031 CRQ5Jisgi|5902482 2.01 

301606 AA326007 Hs. 12056 asjakxjlycoprotein receptor 1 2.01 
300993 AA584930 Hs.191777 ESTs; WeaWy similar to XAP-5-S© prate . 2.01 

320042 T84520 EST duster (not to UniGene) 2.01 

331082 R17059 H&22100 ESTs 2.01 

308851 AI829820 EST singleton (not n UniGene) wifo exon 2.01 

301163 AA732066 EST 2.01 

304734 AA576428 EST singleton (rxtf in UniGene) with exon 2JM 

334855 CH22JGBIES.442J 2.01 

337121 CH2*_FGENE&519-1 ZM 

331B38 AA412498 Hs.104778 ESTs 2.01 

339181 OC2JDlA59H18.GENSCAN.72-6 2.01 

327564 CH03J»gip867811 2.01 

304108 R63932 H&28467 .EST 2 

3150% AA534953 Hs. 163297 ESTs 2 

312777 W92809 Hs.138557 ESTs 2 

3056% AA868536 Hs.126145 EST 2 

323185 R52177 EST duster (not In UniGene) 2 

308681 AI761307 EST singleton (not in U niGene) wffii exon 2 

325755 CH.14Jtsgif6682474 2 

324376 AW499705 EST duster (not in UniGene) 2 



331890 AA432166 H&3577 succinate dehydrogenase canptex; subunlt 2 



r\©-m-231,BT474, LnCap 
NO-H69, NO-H345, RPWE-2 
NC^H69.Caco2 f NCW345 
BT474, MB-MDA-453, MCF7 
NO-H69, M&MDA-231, BT474 
NOH345, NO-H69, RPWE-2 
NO-H345, NOH520, NCHH460 
NO-H69, PRSCJog, PRSCjan 
RPWE-2, OVCA^EB 
NCm520.NCRG3.PC3 
NCR4460, OVCA-R, KT29 
NO-H345, PRSCjcon, PRSCJog 
A549.DU145,EB 
NCW23.NO-H460.NCW58 
NCW345,MB4fflA-231.BT474 
NO-H345, NCI-K69. PRSCJog 
NCJ-H345, RPWE-2. PRSCJog 
NCW69.GALU6.OVCA* 
NCI-H460, NCW58, NO-H520 
DU145, EB, OVCA-fl 
PRSCLcon, LnCap, PRSCJog 
EB.HT29.DU145 
NC W345. PRSC.ccn, NCI-H69 
OVCAR.CALU6.ffi 
MB4y©A-231. NCt+i69, NCI-H345 
NCW69. PRSC jan, RPWE-2 
NCI-H69, PRSCJog. PRSC_con 
DU145, EB, CALU6 
PRSC.ccn, RPWE-2. PRSCJog 
RPWE-2. NOH69.NOH23 
PRSCjwn, NCI-H345. PRSCJog 
NCI-H345, NCI-KKS, RPWE-2 
KT29.A549.NCW460 
NCW58.NCW69, PRSCJog 
NO-H23. NO-H520, NOH460 
NCW460.DU145.NOH23 
MCF7, MB-MDA-453, OVCA-R 
PRSC.ccn, NCI-H345, RPWE-2 
NCJ-H69, NCI-H345, PRSC.ccn 
MBMDA-231, NCW345, PRSC_oon 
MB-MDA453.DU145.EB 
HT29.NCW23.NCU58 
PRSC.ccn, NCI-H345, NO-H69 
EB.DU145, MB-MDA-435S 
DU145.ffi.PC3 
OVCA-R, PC3, MB-MDA-435S 
U)Cap,MB^DA453,DU145 
NC W345. RPWE-2. PRSCJog 
NCI-H69. NCI-H345, PRSCjcon 
BT474.BT474.MCF7 
NO-H345, PRSCjxm. NOH69 
BT474.KT29.DU145 
BT474. OVCA-R. MCF7 
MWICM35S. M&WDA453, LnCap 
PRSC.ccn, NOH345, MB-MDA-231 
KT29.HT29.BT474 
EB.A549..BM74 
RPWE-2. PRSC.con, NC W345 
NCW345. PRSC.con, PRSCJog 
DU145.BT474.PC3 
CALU6.M&MDA453.A549 
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Table 4 

Pity* Uixmjdc Bos piobcsci idoxtifxcr uom^cf 

UmgcodD: Unigcnc number 

Ucigcne Title Umgcnc gjene tiUe 



Pkoy ExrJUxn UniGJD ComptetB_rrtte 



Ratio teet/BS Top 3 expressing cell fines 



313166 AI801098 Hs.151500 
334593 

331084 R20655 Hs£1281 
324598 AA5Q2659 Hs.163986 
314071 AA192455 Hs.188690 
315178 AW362945 Hs.162459 



H58097 Hs.161023 
AA533447 



AA378739 

AA731209 

AKJ93177 

AA41BS87 

AA421163 

H40388 

AF086372 

AA582082 

AA565051 

AW271974 

AW292247 

AI732331 

AA1356Q6 

AW162263 

A1539443 

AA086110 

AA135159 

AA078493 

H01560 

A1865455 

AA311443 



331433 
315021 
337635 
324048 
300781 
320701 
332471 
331858 
330987 
322309 
324733 
313577 
310966 
311332 
314522 
330886 
313597 
314439 
320807 
311804 
321354 
325169 
312828 
321226 
327772 
315642 
311905 
312754 



Hs.134923 
Hs.120380 
Hs.163848 
Hs.131965 

Hs.199410 
Hs.155029 
H&21Q295 
HsJ255Q52 
Hs.167750 
H&189384 
H&249990 
Hs.137447 
Hs.188536 
Hs^03349 



ESTs 1123 

CK23J=GENES.408_3 8.06 

Human done 23732 mRNA;paifialafe 7.89 

ESTs 7.77 

ESTs 7.76 

ESTs 081 

CR12_h3gp)17Q36 634 

EST 016 

EST cluster (not In UruGene) 6.15 

CH2^^AAC000(S7.GB4^AN34-1 SM 

EST duster (not in IMGens) £77 

ESTctu3te{notlnLlnfGene)wffliexonh 5.72 

ESTs 5\68 

nudear receptor co-represser 2 5X8 

ESTs £66 
ESTs535 

ESTdusier(notinlhtf5ene) 531 

ESTs 5.17 

ESTs 5.16 

ESTs 5.15 

ESTs 5JJ5 

ESTs; Moderately similar to IU I ALU CLA 5.04 

ESTs; WeaWy s!m3ar to UN ALU SUBFAM1 4.93 

ESTs • 4.84 

ESTs 4.84 

H sapiens done 24838 mRNA seq 4.83 

ESTs 4.82 

EST duster (not in UnfGene) 4.81 

ESTs; WeaHy similar to IUI ALU SUBFAM1 4.8 

ESTs; Moderately similar to 01! ALU SUB 4.78 



NCW345, OVCfcfl. LnCap 



CaGo2,EB, OVCfcfl 

OVCA-R. OVCfcfl 
LnCap.OVCA-R.ffi 
OVCfcfl. B3.CALJJ6 
CALU6, EB, DU145 
OVCA-R, S.CALU6 
NCW69, NCW345. PRSC.con 
OVCA-R, A549, EB 
PC3, EB.CALUS 
NCW69, NCtt#45, DU145 
OVCA-R DU145, EB 
MB-MDA-453, MCF7, MB-MDA-435s 
A549.NCW345.NCW69 
LnCap, A549, OVCfcfl 
OVCAfl,DU145,Caco2 



Hs.163813 
HS211818 

Hs£51416 H sapiens mRNA; cDNADKFZp586E2317(from 



CH.05JisgIj5867964 
AA742222 Hs.120634 ESTs 
AA555215 Hs.151913 ESTs 
Hs.250383 ESTs 

CH22J=GBES.13-7 
331644 TB9544 Hs.173734 ESTs; WeaJdystmfar to Ofl ALU CLASS B 
CHZLFGBCS.401-2 
ESTs 
ESTs 



316261 
300417 
300610 
324718 
332170 
324042 
331148 



AW134405 
AW139492 
N72598 
AB57019 
F04112 
AA377589 

R73816 Hs.17385 



4.74 
4.7 
4.7 
4.59 
4.58 
4.55 
4.55 
4.53 
4.52 



Hs.144967 
H&245887 

H359120 DEAD/H (Asp-Qu-Al8rAsp/hfe) box poJ)«^ 452 

Hs.116467 

Hs.177178 



321920 N63915 

320832 AA214584 

321971 A1680459 H&201441 

308572 AI707882 

302459 AF169255 

321847 T08401 

337884 

307494 AI269188 Hs.175555 
314915 AA573072 Hs.187748 



319379 
312332 
331445 
315841 
315712 
319559 
300791 



791443 

R33041 

H69093 

AVV13S397 

AI950133 

AA773876 

AL138455 



Hs.193963 

Hs. 106200 

Hs.41215 

HsJ247572 

Ks.120882 

KsJ251597 

Hi2ffi135 



ESTs 4.5 

ESTs 4.47 

EST duster (not In UniGene) 4.45 

ESTs 444 

CH.09_hsgIj5888527 4^43 

ESTdustef(nctbiUnlGefie) 434 

ESTdustar(notbiUni6ene) 4J34 . 

ESTs 433 

EST singleton (not In UniGene) with exon 433 

EST cluster (not in UniGene) wfih exon h 4.28 

EST duster (not m UnJGena) 4.25 

CH22_EMAC005500.GENSCAN^4-2 4.23 

EST 4.23 

ESTs; WeafysMarto III! ALU SU8FAM1 4.21 

CH22JSBffiS.1*2 4^1 

ESTs 42 

ESTs 4.19 
ESTs • 4.19 

ESTs 4.19 

ESTs; \tooeiate!y similar to Oil ALU SUB 4.18 

ESTs 4.15 

ESTs; Moderately simitar to ltfl ALU SUB 4.13 



OVCArR.DU145.PC3 
PRSC.con. PRSCJog, NOH345 
OVCfcfl. PC3.EB 
NCW69, PRSCJog, PRSCjoon 
Caco2.OVCfcfl.B3 
EB, DU145.HT29 
OVCfcfl, DU145.Caco2 
NCLH460, NC W345, NCU423 
DU145,Caco2.MBM)fc231 
PC3, OVCA-R. DU145 
OVCA-R, PC3, Caco2 
DU145, EB, OVCA-R 
NCI-H345. DU145, UCap 
DU145,DU145,DU145 
4.75 DU146\ OVCfcfl, N&MDA453 
HT29, MB-MDA-231, NO-H345 
DU145, S, M&M3M53 
DU145, Caco2, PRSC_con 
OVCfcR.PC3.EB 
NCW69, PRSCJog. NO-H345 
OVCfcfl, NOH345,Caco2 
Caco2, Caco2, EB 
NCW460,NWi345,Caco2 
DU145, CALU6, B 
OVCfcfl.PC3.EB 
LnCap, PC3, PRSC_ccn 
Caco2,DU145,DU145 
NC W345. PRSCjoon, PRSCJog 
CALUS, OVCA-R. EB 
HT29,BT474,NOH69 
Caco2.A549.A549 
NCW23.CALU6, OVCfcfl 
DU145, HT29, CALU6 
MCF7.NOH345, OVCA-R 
MB^DA-231, OVCA-R, LnCap 
MB-MDA453. M&MDfc435s. MB-MDA-231 
HT29, NOfl23, MBWfc435s 

NO-H520.NCW58 
PC3,OVCfcfl,Caoo2 
NCI-H69, NCW345, PRSCJog 
PC3. OVCfcfl, LnCap 
NCI-H69, OVCA-R. NO-H4S) 
EB.HT29.DU145 
Caco2.MB^lDfc453, LnCap 
LnCap. NOmte, OVCfcfl 
NCH045. 03002, DU145 
NCI-358, RPWE-2, MCWH460 
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312129 AW300867 EST duster (not in UniGene) 4.12 

321166 AA411263 Hs.123783 ESTs 4.11 

313220 AI971981 Hs.118241 ESTs 4.1 

314022 AW452420 Hs.248678 ESTs 4.1 

321359 AW474412 EST duster (not in UniGene) 4.1 

328841 CK07Jisgj|638192O 459 

337698 CH2^_0VtAOH)55OO.GBJSCAM5^5 459 

333245 CH2ZJ ? GENESL115JJ 4.09 

311958 A1247472 Hs.132965 ESTs 4.06 

314775 A1149880 Hs.188809 ESTs 4.06 

317901 AW1 50944 HsJ250541 ESTs 4.06 
309985 AW452919 EST singleton (not in UniGene) wfthexon 4.05 
311004 AA632846 EST duster (not in UniGene) 4.05 
323497 AI523613 Hs^21544 ESTs 4.04 
332347 W60326 Hs.221716 ESTs 4.04 
331388 AA456852 Hs.43543 suppressor of whte apricot homotog 2 4.01 
313197 AI738851 t&222487 ESTs 3.96 
315710 AA931550 Hs.192785 ESTs 335 
316897 AAB38114 EST duster (not fn UniGene) 354 
322564 W86440 Hs.118344 ESTs 354 
304605 AA513225 EST singbton (not In UniGene) with exon 35 
325726 CH.14Jtsgi)6552447 35 
320190 R32047 Hs.141012 ESTs; WeaWy similar to HQ ALU SUBFAM1 359 
331566 K63062 Hs.48703 EST 357 
319403 T98413 EST duster (not In UniGene) 356 
324643 A1436356 Hs.130729 ESTs 354 
315298 AJ969314 H&211377 ESTs 352 
321632 AA419617 EST duster (not in UniGene) 351 
313219 K74924 Hs.182099 ESTs 35 
330833 AA046804 ESTs; Weakly simfer to M ALU SUBFAMJ 35 
327289 CR01Jisgij5fi67481 3.79 
314429 AW300749 EST duster (not In UniGene) 3.79 
314475 AI911160 Hs.127505 ESTs 3.79 
317130 AW293995 Hs.192277 ESTs 3.78 
336635 CH22_FGENES.13-5 3.77 
333323 CH2^JGENES.138J6 3.76 
332135 AA620331 Hs.245351 EST 3.75 
316979 AA8610S7 EST duster (not in UniGene) 3.75 
316435 AI671871 Hs.192618 ESTs; WeaWy similar to fill ALU CLASS C 3.74 
315422 AW135357 Hs.192374 ESTs 3.73 
336616 CH22_fGENES.613_5 3.72 
32028 W93241 EST duster (not in UniGene) 3.71 
300463 N52510 Hs. 186470 ESTs 359 
306881 A108B695 EST singtetan (not in UniGene) with exon 358 
337304 CH22JGENES581-6 357 
323693 AW297758 Hs.249721 ESTs 357 
331073 R07998 Hs. 18628 ESTs; WeaWy similar to lili ALU SUBFAMJ 357 
318162 AW296277 Hs.132171 ESTs 357 
318042 AW294522 Hs.149991 ESTs 3.68 
308069 AI470895 EST singleton {not in UniGene) with exon 354 
327614 CH.04_hsgIj6525283 3.62 
337514 CH22J=GENES5C»-7 352 
332083 AA608794 Hs.112592 ESTs 3.6 
327793 CH.05J»sgif5867979 359 
331053 N70242 Hs.183146 ESTs 359 
303769 AA134688 Hs.173415 ESTs 358 
319872 R97130 Hs.189689 ESTs 358 

317902 AI828602 Hs*11265 ESTs 3.57 
324090 AI656531 Hs.116070 ESTs 357 
300120 AW204314 Hs.170784 ESTs 357 
307752 A1339447 EST singleton (not in UniGene) with exon 356 
322438 W44531 Hs.167851 ESTs 355 
311275 AI659166 Ks.207144 ESTs 355 
338830 CH22JXJ246D7.GENSCAN57 354 
315647 AA648983 H&212911 ESTs 353 
331469 N22273 H&39140 ESTs 352 
313445 AI123657 Hs.127264 ESTs 351 
330139 Cft21_p2gq4210430 35 
304450 AA404521 Hs.10326 coatomer protein comptex; subunit epsilo 3.49 
325763 CK14J* gp82475 149 
312803 AA677934 Hs.117864 ESTs 347 
303654 AA436942 Hs.168308 ESTs 346 
317924 AI222324 Hs.166306 ESTs; Wealdy surfer to zinc finger pot &46 
312354 AA036955 Hs.167040 ESTs 3.44 
337517 CH22J=GB*ES51±6 3.43 
324865 AA7Q2138 Hs.114103 ESTs 3.42 



OVCArRMCF7,A549 

OVCMlCacQ2.PRSC.oon 

OVCMlDU145.Caco2 

OVCA-R EB.PC3 

DU145.OVCMl.PC3 

NOU69, PRSCJog, NCW345 

NCWB45,NCW69,OVCA* 

PRSCJog, PRSCjoon, NCW345 

EB, DU145, CALU6 

OVCA-R, PC3.EB 

BT474. M&MDA-453, MB-MDA-435S 

MB-MDA453, NCLH23, NCI-H52Q 

MB-MDA-453, OVCA-R, EB 

LnCap, OVCA-R, EB 

EB.CALU6.PC3 

A549,EB,Caco2 

OVCA-R.ffi.PC3 

EB. MB-ffflA-231, OVCA-R 

OVC/VK.A549.MBWA453 

NaH460.Caco2.ra 

NCW345,RPWE-2,BT474 

OVCA-R, LnCap, LnCap 

DU145, NCJ-H23, PRSCJog 

NOH23, NCI-H460, NCL358 

NCVH345, PRSCJog, LnCap 

OVCA-R. DU145, NCI-H345 

NCI-H345, PRSC.ccn, PRSCJog 

EB.OVCfcfl.A549 

EB,Caco2, OVCA-R 

UiCap.DU145.PC3 

EB.KT29.DU145 

OVCAtf,PC3.PRSC_con 

DU145.CALU6.NCI4<69 

EB, PC3,Caco2 

NCW69, NOH345, PRSCJog 

NCI-H460, NCI-H23, PRSC.con 

NCm345.A549 t Caco2 

NCW345, NCJ-H69, RPWE-2 

MB-MDA-435S, MCF7, kffi-MDA-453 

OVCA-R A549.EB 

NQI-K69, NCI-H345, RPWE-2 

MB-MDA-231.NCW69, EB 

OVCA-R. A549, DU145 

CALU6, HT29.EB 

MCF7.MBM5M53. LnCap 

OVCA*, M&MD/U53, DU145 

RPWE-2, NOH345, OVCA* 

MB-MDA-231, DU145, CALU6 

EB, HT29, CAIJU6 

C3co2,Caco2,NClH23 

NOH460. NOH345, NO-H69 

NCK558, NOH23. NOH460 

EB, OVCA-R, DU145 

LnCap, OVCArR.3 

OVCA-R, EB, Caco2 

HT29, CALU6, CALU6 

PRSCjan. LnCap, RPWE-2 

CALU8,NCi«45.0VCArR 

PRSCjoon, NOH345. PRSCJog 

NC1-H69, NOH345, PRSC_ccn 

N0358, HT29, MBWDA-231 

Nd+B45,Nf>H69,Caco2 

IvS-MDA-231, PRSC.con, LnCap 

LnCap, PC3, OVCA-R 

OVCA-R, MB-MDA-453, CALU6 

EB, A549, CALU6 

EB, OVCA-R, A549 

EB.CALU6.DU145 

NCW4345, NCI-H69, NCW4460 

PC3, BHT474, OVCfcR 

OVCA-R, Caco2, MB-MDA-453 

DU145,NOH460,NCW69 

PRSC_con, PRSCJog, NCI-H69 

Caco2, MB-MDA-435s, NCI-H460 

NCWi69.HT29.PC3 

NO4Q3.NOH460,NCW520 
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323755 AW300094 EST duster (not fcUriGens) 3.42 

314452 AL042699 H&20S222 ESTs 3.42 

337911 C^22_0itAOX£5CHXGENSCAR59-6 3.42 

318088 A1025499 Hs.132233 ESTs 3.41 
31189 AA704705 Ks.181044 ECTgW^»stafertaOianA;HumanO 

Canpiuxed W3n L-CanaDne [Rsapiens] 3.41 

314409 H15560 Ks.131833 ESTs 3.41 

323333 AA228883 EST duster (not In UniGene) 3.41 

32690 CR14Jisgij58S7021 14 

314539 AA398216 Hs.190092 ESTs 14 

310567 AI691065 Hs.155780 ESTs 3.4 
33J5Z7 S77356 transcript ch21=o5c^xnytin sertsShriiy c 

8 stomach cancer cefl Ones, mRNA, 262 n 33 

31468) AA436007 Hs. 188780 ESTs 139 

321321 AB033072 EST duster (not In UniGene) 139 

32335B AA234C09 Hs.188715 ESTs 338 

328592 Oi07Ji3gP68227 338 

311116 A1S31195 HsJ232193 ESTs 136 

323853 AA393460 EST duster (not in UniGene) 336 

327740 CUQSJis g£867943 335 

326857 CH20_hsgq6552460 133 

317787 AW339612 H&249364 ESTs 131 

325760 CK14J»gi|6552449 13 

337513 CH23JH3ENE&8Q&4 129 

336606 CH2^J=GENESj429J3 13 

322895 AW470295 Hs.192152 ESTs 13 

314312 AA814971 Hs^57634 ESTs 129 

328224 CK08_hsgp68101 128 

336128 CH22_FGENES.701_16 127 

332442 AA281323 Ks.4947 ESTs 127 

302514 M14269 EST duster (not to UniGene) w2h exon h 127 

313749 AW450376 Hs.130803 ESTs 126 

302891 AI681578 Hs. 114154 ESTs 126 

334690 CH2£JGBES.420__3 125 

308676 AI761036 EST singteton (not in UniGene) wffiiexon IS 

304254 AA046273 Hs.1 11334 ferrittn; Bght pofypepfide 324 

311994 AA648314 Hs.13849 ESTs 124 

321020 AB023170 HsJ227850 K1AA0953 protein 124 

316724 AA810788 Ha 123337 ESTs 123 

326942 CR21Jwgij6004446 122 

324824 AI826999 H&224624 ESTs 121 

320789 R78712 EST duster (not In UniGene) 121 

315070 AW131368 Hs.186736 ESTs 121 

303794 AW2419S7 Hs.197025 ESTs 119 

310237 AI884313 Ha 158306 ESTs 119 

313960 AA13Q859 EST ckister (not in UniGene) 118 

336634 CH22J=GBE&13-4 118 

301085 AA779G58 Hs.190428 ESTs; WeaMy skniar to NG26 [hLsapjens] 117 

313774 AW136836 Ha 144583 ESTs 117 

307177 AI188864 EST singteton (not in UnGene) wtfo exon 117 

324025 AI174861 Ha 190623 ESTs 117 

313099 AB07359 Ha 128064 ESTs 117 

305536 AA770682 EST shgteten (not tn UniGene) with exon 117 

331916 AA446131 Hs.124918 ESTs 117 

314912 AI431345 Ha1617B4 ESTs 117 

303388 AL03S604 EST duster (not in UniGene) vft exon h 117 

332273 R05818 Hs.173830 ESTs 116 

314697 AWD88739 Ha243770 ESTs 116 

335344 CH22J=GENES.538_3 115 

326162 CR17_bsgtj5867168 115 

304467 AA424703 EST singleton (not in UniGene) w8h exon 115 

339340 CH223A354I12.GENSCAN27-8 115 

325393 CH12Jisgp86©21 113 

315367 AA732484 Ha1 69393 ESTs 3.13 

307085 AI16Q868 EST singleton (not In UniGene) wffli exon 112 

313001 N29264 Ha249591 ESTs; Moderate^ similar to III! ALU SUB 112 

337605 AI290006 EST singleton (not fn UniGene) wfih exon 112 

325710 CR14_hsgI)$682473 109 

313810 AA400079 Ha257854 ESTs 109 

335482 CH22J=GENES570J1 109 

326310 CR17JtsgiJ5867277 108 

325742 Cai4Jisg96552448 108 

312467 AI241809 Ks.75458 rftosomaJ protem L18 108 

327309 CrU)1JisgiJ6456757 107 

310583 AW205632 Ha211198 ESTs 107 

322373 W25673 Ha130829 ESTs 107 



PRSCjx*. RPWE-2. NCWC45 
NOH345. PRSCjav PRSQJog 
OVCM*.PC3,HT29 
C^UI6,LnCap,OVCA-R 

LrCap T M&A5DA435s,A549 
NCUi69,lrCap,LnCap 
Gffl*. OVCA-R, NCUE9 
HT29.CALU6.DU145 
MB-MDJV231,BT474,S 
PRSC_con, NOH345, NOK69 

NCW23.Caco2.A549 

OVCA-R. BT474,Caco2 

NCU58.EB.Caco2 

DU145, CAUJ6, CAUJ6 

MCR,N035B,M&tfDA431 

NOH520, NCW23, PRSQJog 

DU145,3,Caco2 

EB, LnCap, OVCA-R 

NCW69.MCF7.NCW345 

NCW345, PRSC.con. PRSCJog 

EB.CALU6.HT29 

Lr^NCRf23,NOH460 

NOK69.A549.NCW3 

DU145. Caco2, EB 

RPWE-2.NOH69.NOH345 

DU145 f NCW345,LnCap 

BT474, NCHJ520, M&MJA-231 

taco2.PC3,NCI-H345 

DU145,CALU6,NCi-H52D 

OVCA-R, NOH69.DU145 

LnG^NCW^ PRSCJog 

NCI-H69, RPWE-2, PRSC.con 

DU145.MBiyiDA231.HT29 

OVCA-R.Dm45.A549 

NOH460, NCKH23. MB-MQA-453 

EB. MCF7. MB-MQA435s 

DU145. OVCA-R, BT474 

HT29.BT474.NOH23 

OVCA-R, MB-MDA453, EB 

DU145.LnCap.EB 

Caco2. NCW58, NO-H460 

OVCA-R, PC3,LnCap 

Na358.NOH345.MCF7 

MBWDA531.HT29,Br474 

NCW«9.Na-H345.BT474 

NCI-H345, NC W345, NCW58 

Caco2, EB, OVCA-R 

EB. CALU6, CALU8 

OVCA-R, DU145.PC3 

M8-MDA-231, BT474, EB 

N0358, 03032. HT29 

EB. OVCA-R, Caco2 

EB. BT474, MCF7 

KT29, NCW58.Caco2 

MCF7, DU145, EB 

NB-MDA453.DU145.MCF7 

PRSCJog, NCW345. PRSCcon 

BT474, Hra,riT29 

NOH23, RPWE-2, NO-B460 

LnCap, OVCA-R, MB-MDA453 

Caco2,NaH23.NCW58 

OVCA-R, EB, MB-MDA-453 

RPWE-2, PRSCjan, PRSCJog 

NCW345, OVCA-R. Caco2 

MBM5A231.HT29.NOH23 

NOH69. »»WDM53, BT474 

EB.DU145.CALU6 

NCW460 f NO-358.SaH23 

MCF7.ASWM51PC3 

NCW23.NaH460.HT29 

NCW58.NCW23.NCI4Ma) 

NC1-H69, MBMDA^35s, MBMDA435S 

OVCA-R. A549,Caco2 

NOH59, PRSCjsn, NOH345 
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324497 AW152624 Hs.136340 ESTs 106 

315(85 AAS31815 H&243788 ESTs 3.06 

302445 N79647 EST cluster {nolInUnIGen8)wi8»e«Hih 3.05 

302842 AW383228 Hs.163834 ESTs; Highly slmferto Chp [Rncjvegfco 3.05 

317346 AA952H75 H&221274 ESTs 334 

334650 CH2U=GBE&417 - 17 334 

308644 A1002913 EST singleton (not h UnlGene) with axon 104 

322682 A1110679 EST duster (not In UniGene) 3.03 

311065 AW204582 Hs224905 ESTs 3.03 

318623 AA355439 Rs.151547 ESTs 3.03 

304978 AA617735 EST singleton (not In UnlGene) wffli axon 103 

305554 AA774567 Ha.121774 EST 3.03 

302574 U66199 H&249165 fibroWast growth factor 11 3.03 

336212 CH22_FGENES.719 6 3.02 

302893 AL1 17539 Hs.173515 H sapiens mRNA; cDNA DKFZp58SH021 (from 

315166 AJ343966 Hs.158528 ESTs 3.01 

335608 CH2*J=GENES.58^3 3.01 

330028 CK17_p2gi|6634847 3.01 

303179 AA071215 EST duster (not In UnlGene) vffiiaconh 3.01 

307625 A1299617 EST singleton (not in UnlGene) wifft axon 3 

323074 AL119445 H&203213 ESTs 3 

336232 CH22JH5ENES736_7 3 

334915 CH22LFGaJE&457_4 3 

329116 CKXJisgii5868650 3 

333495 CH22J=GENES.168 5 3 

303756 AI738488 Hs. 11 5838 ESTs 239 

332134 AA610123 Hs.139240 DKFZP564F1422 protein 2.99 

322916 AW367294 Ha154091 ESTs 2.99 

31B050 AKJ52093 Hs.133132 ESTs 239 

301019 AI147356 Hs.98722 ESTs 239 

315213 AA587773 Ha 136494 ESTs 238 

339251 CH22_BA354l1ZGENSCAN7-5 238 

303835 T05645 EST cluster (not In UnlGene) wahexonh 2.97 

300070 AI174603 H&256832 ESTs 237 

320954 AB028953 H&204121 WAA1030 protein 237 

327624 CK04J»gip867871 2.97 

329029 CRXJw 9^6525302 236 

317040 AA868584 Hs.126154 ESTs 236 

328016 CH36JB g?5902482 236 

312674 AI762475 Hs.151327 ESTs; Moderately slmlar to till ALU SUB 236 

332301 R70253 Hs. 127826 ESTs 236 

300351 AI732374 Hs.105834 ESTs; WeaWy slmferto 25 kDa trypsin I 235 

318226 AKJ78446 Hs.134125 ESTs 235 

311349 AW292933 H&254110 ESTs 234 

312757 AI285970 Hs.183817 ESTs 234 

316507 AI381515 Hs. 158381 ESTs • 234 

302278 AF018080 Hs.173730 Mediterranean fever 233 

311016 AW173166 Ha243468 ESTs 233 

323884 AA340724 Hs314028 ESTs 232 

336632 CH22J=GENES,13-2 232 

328886 CH37Js g^588003 232 

301859 761587 EST duster (not in UnlGene) wffli exon h 232 

323775 AA329856 Hs.143022 ESTs 232 

315426 AI391486 Ks.128171 ESTs 232 

322264 AFQ86242 EST duster (not fn UniGene) 232 

315135 AA627561 Hs. 192446 ESTs 231 

327982 CH36_hsgf|5868216 231 

314530 AKB2358 Hs.131741 ESTs 231 

315003 AA527650 Hs. 156037 ESTs 23 

339032 CH22JDA59H18.GENSCAN25-1 23 

308379 AI623950 Ha2186 eukaryotic translation elongation factor 239 

312133 T87714 HsJ221665 ESTs 238 

307992 AI434166 EST singleton (not fn UniGene) wffii exon 238 

308010 AI439190 Hs. 181 165 eukaryofic translation ekmgatlon factor 238 

320154 AA336019 Hs. 11 9559 ESTs 238 

331496 N34929 Hs.171984 ESTs Z86 

320016 H57622 Hs.194574 ESTs 236 

317923 AW450544 H&220751 ESTs 286 

301822 X17033 Hs.1142 tntegrin; alpha 2 (C049B; alpha 2 sutxini 236 

311759 AA7O5075 Hs.169536 Rhesus blood group^associated giycoprote 235 

315083 A1221325 H&21Q655 ESTs 234 

317759 A1908455 Hs3Q2460 ESTs; Weakty similar to hypothefeal L1 233 

313980 A1633205 H&159914 ESTs 233 

310941 AI4534Q2 Hs.173705 ESTs; Weakly similar to QD ALU CLASS C 233 

313593 A1911488 Hil213724 ESTs 233 

314973 AW273128 H&254669 EST 232 



NOH345, RPWE-2, PRSC_con 

Caco2.DU145.EB 

OVCA-R,A549,NaH460 

A549, DU145,NCWi23 

BT474.HT29.Hra 

MCF7, BT474, OVCA-R 

GALU6, MCF7, BT474 

NCtH345.RPVtE-2.OVOW* 

PRSCJog, PRSCjon, NCWi460 

DU145.MaWDA435s.Hr29 

CALU6. BT474. M&WDA-435S 

EB. NCWM60.Caco2 

HT29.DU145.PC3 

NOH69,NOH23 ( NC14i23 

3.02 EB. DU145, CALU6 

Caco2,EB.NCI-H69 

NCI-H23, NO-H520, NOH345 

OVCA4lHT29.LjiCap 

MCF7, RPWE-2, MB-MDA-453 

MB^DA-231, LnCap, BT474 

NCW23.NOH520,NCI-H460 

HT29.BT474,WBWnAr231 

NCLH345, PRSC.con, NCW69 

NC1-H69, PRSC.con. RPWE-2 

OVCA-R NOH69, NCI-H345 

HT29,PRSC_can,DU145 

EB.A549.MCF7 

DU145.DU145.OVCA* 

NCI-H345, DU145, NOH52D 

NCU58, NC W68. MB-MDA435S 

MW«)A-231,BT474 l LnCap 

NCW*9, PRSCJog, HT29 

BT474.NCW345. LnCap 

DU145.A549, OVCA-R 

LnCap.DU145.PC3 

EB, DU145, LnCap 

NOH69. PRSCJog, LnCap 

DU145.EB.LnCap 

NCW345, PRSC_con, DU145 

OV<a^NCI-H69,NCI-H69 

OVCA-R, DU145, MBM)A-231 

NCk358,NCI-H460,Caco2 

NO-H460.NOH2&NCU58 

EB, DU145, OVCA-R 

DU145.UCap.LnCap 

PRSC_con, PRSCJog, RPWE-2 

S.NO-H69.DU145 

NCI-H345, LrrCap, LnCap 

EB.Caoo2.HT29 

NCI-H69, NOH345, MB-MDA-231 

HT29, PC3, UCap 

LnCap, S,EB 

PRSCjoon, PRSCJog. RPWE-2 

CALU6, EB, A549 

Caco2. OVCA-R, DU145 

EB.HT29.DU145 

LnCap. MBMJA453, NO-H69 

NCI-H460, NCI-H520, RPWE4 

PRSCjcon, RPWE-2, MB-MDA-231 

NCI-H69, PRSC_con, RPWE2 

BT474.MB4in2A231.HT29 

Cacc2.MBMMr453.MCF7 

NCW520,MCF7.NCtH23 

Caco2.NCl-H69.NCW345 

MBM)A453.DU145,EB 

WB-MDA-453, PC3, MCF7 

PRSCjcon, RPWE-2, PRSCJog 

NCW345, PRSCjan, PRSC Jog 

PC3, BT474, CALU6 

DU145,riT29,MB-MOA-231 

PRSC_ccn. RPWE-2, NCW345 

HT29.MBWDA231.BT474 

Caco2.MBM2rV453.A549 

NOH345. MCF7,Caco2 

UCap.Caco2.NCW460 

BT474, LnCap, RPWE-2 
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310960 AJ58275B Hs.170561 ESTs 182 

323826 ALD39822 H&207&4 ESTs 182 

325410 CK1^hsgip8ffi921 181 

313911 AE65458 Hs.116385 ESTs 231 

334244 CH22JSggS.36S_3 181 

309333 AWQ25709 ESTshgJrian(notbUnK3era)wffliexOT 181 

328467 m07JiSQjl5868434 181 

318563 AW250501 EST duster (nrt In UnKSene) 181 
326412 Cai9jBgip867362 • 1B1 

303407 AA3G9616 EST duster (nd to UniGene) wfoexenh 18 

328462 CR07Jis 9^5868433 IB 

335157 CH22JFGB1ESJ3MJ 18 

313458 AAD07259 Ha255853 ESTs 179 

310416 AB95047 rfa202395 ESTs 179 

317709 AM35973 Hs.128056 ESTs 179 

321415 AG77596 Hs3337 faansmenibane4supertan%nmber^ 179 

313693 AW469180 Hs.170651 ESTs 179 

309438 AW1Q28Q2 H&225787 ESTs; Morferate^sta2artohypoffi^al 179 

308961 A187Q248 EST sfngfeton (not to UniGene) wfo exon 178 

329107 CHXJisgij5868626 178 

313975 AW025024 Hs.65114 terafin18 178 
330901 AA157818 H&238380 Human endogenous retroviral protease mRN 178 

311749 R06249 Hs.13911 ESTs 178 

329853 CR14_p2 $6682295 178 

322340 APQ88076 EST duster (not in UniGene) 177 

326806 CH20Jis^6469835 177 

314661 AA436432 EST duster(not to UniGene) 177 

322135 AF075082 EST duster (not in UniGene) 177 

331849 AA417078 Hs.193767 ESTs 177 
301056 AI797955 Hs^08076 ESTs; WeaWy similar to D(4) DOPAMINE RE 176 

327739 CH.05Jisgij5867942 176 

308016 AI445116 EST singleton (not in UniGene) tfBi exon 176 

331549 NSKffi Ha237507 EST 176 

331851 AA418599 HsJ8303 caveofin3 175. 

315023 AA5335Q5 Hs.185844 ESTs 175 

335565 CH22J^BJES.579J 175 

306137 AA916176 EST singteten (not in UniGene) wilh exon 174 
332240 N54803 yv31dls1 Scares fetal Over spleen 1NFL 

3T smQar to contains L1J3L1repeta 174 

313246 N90762 Hs.158454 ESTs 174 

303642 AW299459 EST duster (not to UniGene) wffii exon h 174 

325513 CK1OsgPM7035 174 

337236 CH2*JGENES£392 174 

311555 AW407892 Hs2448U7 ESTs 174 

339266 CH22JBA354J11GENSCAN.104 173 

300127 AW028615 Hs235224 ESTs; WeaMy stoiSarto K1AA0422 (Rsapi 173 

311741 R00099 Hs.193642 ESTs 172 

310915 AW449673 H&2D1893 ESTs 172 

324982 T31689 Hs.98518 ESTs 171 

305030 AA629988 EST singleton (not in UniGene) wilh exon 171 

315396 AW296107 Hs.152686 ESTs 169 

319098 AG08374 EST duster (not to UniGene) 169 

309119 AI9Z7384 Hs22B499 EST;MooeratB^sIntotoPK-120preoiT 169 

312095 AW444937 Hs233482 ESTs 168 

324316 A1291330 EST duster (not in UniGene) 168 
331367 AA425688 Hs.41641 ESTs; WeaWy simferto CAGH4 [Ksaptens 168 

339116 CH22_.DA59HiaGENSCAN.4M 168 

324297 AI565566 Hs.168587 ESTs 168 

318728 Z3Q201 EST duster (not to UniGene) 2£& 

304813 AA584540 EST singleton (not to UniGene) with exon 168 

312393 mm Hs.191659 ESTs; WeaHy simitar to Iffl ALU CLASS E 168 

330671 ABQQ2302 H&92236 WAA0304 gene product 167 

3)5406 AA723860 EST stogleton (not to UniGene) with exon 166 

330957 H08778 Hs.133521 ESTs 166 

300350 AJ871129 Hs.172597 ESTs; Weaidysimfer to zinc finger prot 166 

322302 W76021 EST duster (not to UniGene) 166 

321891 AW157424 Hs.165954 ESTs 166 

300124 AB17394 fe242447 ESTs 165 

302747 AF062275 EST duster (not to UniGene) wffii exon h 165 

308741 AI802780 Hs-205002 ESTs; WeaWy similar to III! ALU SUBFAMJ 165 

310802 AB31546 Hs.159732 ESTs 165 

300694 AA0634C6 EST duster (not In UniGene) wffii exon h 2.65 

311395 R23313 EST duster (not to UniGene) 164 

336538 CH22J = GENESJ$40.J 164 

316473 AAB29S61 EST duster (not to UniGene) 164 

328134 CK06jBgjp868Q39 164 



ffl,M&A«DA453.LnCap 

PC3, HT29, CALUS 

M&MDA453, PRSC_con, NCW58 

PRSCjxn ffi, RPWE-2 

OVCA^PC3,M&*mM53 

NCm60,NC^H23,MCK358 

EB, OVCA-R, HT29 

BT474. NCRt23, M&WBA431 

BT474, PRSCJog, RPWE-2 

CALUS, NCW345, DU145 

BT474 f CALU6 I MCF7 

NO-K69, NOH345, PRSCJog 

OVCA*,DU145, UiCap 

DU145.M&WDM35S.PC3 

NCUM60.NO358.DU145 

A549.PC3.OVCA* 

OVCAK.MCF7.S 

PC3, OVCA-R. DU145 

BT474,A©MQA£31,ffi 

DU145,MCF7.MB*ffiA435s 

Caco2,ffi,DU145 

PC3,NOH520,BT474 

OVCA-R, MB-MDA-453, MCF7 

BT474.BT474.HT29 

NttH345.Caco2. LnCap 

NCt-H69, NCWJ45, MB-MDA-231 

NCI-H460, N©-MDA-435s, CALU6 

NCt^58 k NCM^460,Caoo2 

DU145,t3,CAUU6 

NCJ-H69, RPWE-2, PRSC.ccn 

EB.PC3, LnCap 

LnCap, KT29, MB-MDA-231 

M&MDA-453, MCF7, OVCA-R 

M&IUDA231. NOH345, BT474 

PRSCjeon, OVCA-R, EB 

OVCA-R, EB.A549 

EB, UCap, DU145 

DU145, EB, CAUJ6 

NC W69. NCW4345, PRSCJog 

ffi,A549,Caco2 

M&MDA-231, NCI-H345, BT474 

MCF7, MB-MDA-453, NC1-H69 

BT474,NO-H345, NCI-H69 

CALU6.DU145, OVCA-R 

NCW345. RPWE-2, PRSCJog 

UiCap,PC3,OVCArR 

DU145, EB,Nffi-NDA435s 

PRSCjeon, PRSCJog, RPWE-2 

DU145.DU145,NCW58 

OVCA-R, Caco2, EB 

RPWE-2. LnCap, PC3 

LnCap, NCttt23,NCW5S 

Caco2,OVCA*.HT29 

NCI4W60, Caco2, PRSCJog 

MB-MDA-435S, NCW520, NOW60 

DUU5, EB, CALU6 

PRSC.con, OVCA-R. PRSCJog 

LnCap, Caoo2,PC3 

BT474, OVCA-R. RPWE-2 

NCWH345,PRSC_con,EB 

NCW58. OVCA-R. Caco2 

0VCMIE5,MCF7 

EB,PC3,OVCA^ 

NCW23,Na+i520, NCW460 

DU145. OVCA-R, PC3 

EB,OVCA^,Caoo2 

PRSCjxxi, A549.HT29 

Nd4t23.BT474.MCF7 

PC3,EB,OVCArR 

Pf^Cjan, PRSCJog. NOH69 

BT474,EB I MCF7 

EB, OVCA-R, DU145 

DU145,NCWW60,NO358 

LnCap, OVCA^B 

LnCap, EB.CALU5 
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329330 CHJUogl|566880B 254 

316m AJ042101 EST duster (not in UniGene) 2.64 

328015 OL06J»gf|5902482 Z63 

308991 AI379831 EST singleton (not In UniGene) wffli exon 253 

323899 AL042966 EST duster (not In UniGene) 252 

321708 AA476817 EST duster (not In UitfGene) 162 

301752 T75247 EST duster (not In UniGene) wift exon h 252 

309351 AW057547 EST singleton (not in UniGene) wffli exon 252 

314412 AI864270 Hs.1 55654 ESTs 2.62 

309441 AW103055 Ha244230 EST 252 

335993 CH22J=GENES.656_6 251 

318196 AJ056776 Hs.133397 ESTs 25 

322880 AA310521 Hs50848 ESTs; WeaWy slmtorto KJAA0862 proteto 25 

300558 A640051 Hs.122638 ESTs 25 

318594 AA91B320 Hs224581 ESTs 25 

308554 AI698132 K&201923 EST 2.6 

335108 CH22LFGBCS.494J4 25 

312483 AM17526 Hs.1 84636 ESTs 2^9 

311981 AW452773 H&257612 EST 259 

319359 F13458 EST duster (not In UniGene) 259 

300230 AI377746 Hs. 158846 ESTs 259 

316504 AW135854 Hs.132458 ESTs Z59 

322337 AA249804 EST duster (not In UniGene) 259 

301775 AW247670 EST duster (not in UniGene) wflh exon h 259 

301089 AA666396 H&22J727 ESTs 258 

331213 T88698 Hs.163862 ESTs 258 

321121 W23285 EST duster (not In UniGene) 258 

316634 AW241910 Hs.122254 ESTs 258 

322141 ARJ75092 EST cluster (not In UniGene) 258 

312108 T82331 Hs.127453 ESTs 258 

339071 CH22J2A59H18.GENSCAM34-1 2.58 

311666 AW389509 H&223747 ESTs 257 

318662 AJ285698 Hs.1 15367 ESTs 257 

317010 AAB63395 EST duster (not In UniGene) 257 

324710 AI742028 Hs.1 20884 ESTs; Weakly similar to RAS-RHATED PHOT 

327888 CRC8Jisgi|5B68149 256 

336149 CH22LFGENES.706.5 2.56 

312816 H74319 Hs.188620 ESTs 256 

327999 Ca06Jtsgl|58o7994 256 

316761 AJ911173 H&213722 ESTs 255 

336958 CH22LFGENESw367-1 255 

325043 W27919 Hs32944 Inositol polyphosphato^-phosptetase; ty 255 

315417 AW452360 Hs.186770 ESTs 255 

3310)3 N78656 Hs.161535 EST 255 

309403 AW082954 EST singleton (not In UniGene) with exon 2.55 

337289 CH2?J=GENES.672-8 Z54 

314242 AB70943 H&246280 ESTs 254 

328053 CR06Jisgi|5902482 254 

307215 AI193189 EST singleton (not In UniGene) wffli exon 253 

327566 CH.03_hsgiJ5867811 253 

326338 CH.17JJsgi]6056311 253 

318115 AO84027 H&159130 ESTs; Moderately shiflar to UH ALU SUB 253 

307437 AC45683 EST singleton (not in UniGene) wift exon 252 

322059 AA412371 Hs.1 21 344 ESTs 252 

322505 AF147315 EST cluster (not in UniGene) 252 

314032 AWD81897 Hs.1 93211 ESTs 252 

336125 CH22J : GENES.701_12 251 

312765 AB929Q8 Hs.181873 ESTs 251 

335523 CH22J=GENES572J 251 

327585 CH.03J»giJ5867825 .251 

323183 AW393850 EST duster (not hi UniGene) 251 

314418 AI478722 Hs232275 ESTs; Moderately s&nBar to ill! ALU SUB 251 

313361 A1359782 Hs.1 37312 ESTs 25 

305632 AA805Z76 EST singleton (not in UniGene) with exon 25 

331689 W90131 Ha 184075 ESTs 25 

323438 AI540243 Hs.1 13817 ESTs 25 

315742 AI821724 Hs.143198 H sapiens PAC done DJ0872HJ7 from 7q31 25 

305971 AA886874 EST singleton (not In UniGene) ftffli exon 25 

336633 CH22J : GBIES.133 25 

304746 AA577793 EST singleton (not In UniGene) with exon 2.49 

327925 CK06Jisgi]5B68172 2.49 

336055 CH22J r GENES583_4 2.49 

328888 CH.07Jisgi]6588003 2.48 

311244 AW016694 Hs.197689 ESTs 2.48 

327155 CH.01_ns giJ58&7549 2.48 

334907 CH22_FGENES453^2 2.48 



EB, CALU6, DU145 

NO-H345, MB-MDA-231, PRSCJog 

BT474,HT29,M&MDA231 

BT474,ffl > NCW23 

DU145.A549.CALLI6 

EB.A549, CAUJ6 

HT29,BT474.NCI-H345 

NC W23, PRSCron, LnCap 

CALU6, A&M2A431, BT474 

BT474. MB-MDA-231, MB-MDA453 

NO-H460, NCW5B, NCI-H520 

EB.CAUU6.Hra 

DU145.A549.PC3 

OVCA-RNCLH69.MCF7 

PC3.MB-MDA453.DU145 

LnCap, EB.NO-H345 

NCW69. NOH345, M&MDA-231 

PC3, DU145.0VCA-R 

NC W460, MB-MDA453. NO-H23 

LnCap. NOH460, M&WDA-231 

HT29, NCW58.NCW345 

DU145.S.CALU6 

NCI-H69, NCW345, NOH345 

NCJ-H345, RPWE-2. PRSCJog 

PRSCJog, PRSC_con, RPWE-2 

DU145.EB.OVCA-R 

NCI-H69.MB-MDA-435s.PC3 

MCF7, HT29, BTT474 

PC3, OVCA-R, HT29 

A549.CALU6,Caco2 

CALU6, DU145, EB 

OVCArR, MB-MDA-231, BT474 

OVCA*.DU145,EB 

NCW520, PRSC_con, NCW58 

257 LnCap. DU145. MBMDA-453 

NOH345, MB-MDA-435S, RPWE-2 

NCI-H69.PC3.A549 

EB.Caco2,NCW460 

NCW58, NCm520, NO-H23 

NCLH345, NO-H460, MBWDAr231 

HT29, CALU6, CALU6 

NCI-H460.NCI4i23.HT29 

NO-H345. NCI-H69, PRSCjan 

NOH345. PRSC.con, PRSCJog 

BT474. MB-MDA-231, MCF7 

BT474, HT29. MB-MDA-231 

Caco2, MB4ADA-435S, MB-MDA453 

MB^DA231. DU145. MB-MDA-453 

HT29, CALU6, M^MDA-231 

NCW69, NC1-H520. NOH345 

PC3.A549.DU145 

DU145.EB.PC3 

NCI-H23, Nd-H520,NO358 

EB.DU145, OVCA-R 

PRSC.oon. RPWE-2. NCW69 

NCI-H345. LnCap, DLH45 

NOH69. LnCap. DU145 

NCW423, NCW58. NOH520 

HT29.BT474.OVCA* 

HT29. NOH460, MB-MDM53 

MB-MDA-231 , LnCap, RPWE-2 

EB.DU145.DU145 

CALU6, HT29, DU145 

MB-MDA-453, NOH460, NCI-H23 

NCI-H69.EB.A549 

NOH345, PRSC_con, MB-NDA-231 

MCF7. MBMDA453, M&M)A-435s 

NCW58.NCI+I23.NCW520 

NCWH69.NCW345. PRSCJog 

NCI-H69, BT474. MB-MDA-231 

NCI-358, NCW58, NC^H460 

EB, HT29, MB-MDAr231 

MB^DA435s, MB-MDA-453, PRSCJog 

NCW345.MCF7.PC3 

NC1-H69, MB-MDA-231, NCm345 

DU145. NOH345, MBM)/W31 
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314887 AA91Q236 Hs.139469 ESTs Z48 
339435 OffiLOB7SN16w(»«CAN.18-10 2.48 

334172 OC2JGBIE&3«L5 Z48 

320767 AA293525 EST duster (not in UmGerte) 148 

338772 CH23.FGBJESJ56-1 247 

326957 Cri21JisgS6469836 247 

308505 AIS86615 Hs200778 EST; Weak* staflar to SALIVARY PROLINE- Z47 

321325 AB033100 EST duster (not In UniGene) 2.47 

313149 AW291Q92 H&201058 ESTs 2.47 

3383^ CH22_BtACXH)5500i3Bi^m3077 Z46 

307877 A13SB880 EST stop^(nctftaUrdGene)wffl> exon 248 

311525 AI799444 Hs247095 ESTs; Moderately sMar to Oil ALU SUB 248 

337023 CH22_FGENES.433-12 Z46 

300916 AB61798 Hs.164675 ESTs 2.45 

302919 AL1373S2 EST duster (not In UnK3ene)wffliexonh 2.45 

320303 AL079289 Hs.137154 H sapiens mRNA full tengfo Insert cDNAc 245 

318359 AI097439 Hs.135548 ESTs 2.45 

314364 AA535840 Hs.162203 ESTs;WeaWysintotoafiefnafiveiysp Z45 

326763 CR2DjBg?6598307 Z45 

319900 AW408392 EST duster (not taUniGene) Z45 

314451 AA586368 Hs.190232 ESTs Z45 

300641 AW237699 Hs.118346 ESTs Z44 

324368 AW299374 EST duster (not In UnHSene) Z44 

336510 CH22J^3B£S*34_5 Z44 

326876 CR20jBgi|66a2507 Z44 

307753 AI340509 Hs.1 82426 ribosonaJ protein S2 244 

317071 M78728 Hs.1 32694 ESTs Z44 
313877 AA767869 Hs250113 ESTsiModoate^dmtertofliynAJhonn 

component TRAP150 [Rsaplens] 244 

315974 AWQ29203 Hs.191952 ESTs Z43 

322970 AI885052 Hs.142287 ESTs; WeaWysimBartolW ALU CLASS F 243 

317733 AI0282S7 Hs.132317 ESTs Z43 

313599 AA748749 Hs.136742 ESTs Z42 

323014 AA3Q5198 EST duster (not In UniGene) Z42 

324980 AA969121 Ka254296 ESTs 2.41 

301326 AA883831 Hs252924 ESTs Z41 

308695 AI763350 EST singleton (not si UniGene) with exon 241 

330166 CHXGLp2g56648220 Z41 

317552 AW451400 Hs.127019 ESTs 241 

320572 AI929508 Hs.159590 lymphocyte an^en 6 compfex;tocusH 241 
315618 AB87341 Hs.154029 ESTs; Weakly simBar to TRANSCRIPTION FAZ41 

331610 N91109 Hs54681 ESTs Z41 

311731 AW393528 Hs248875 ESTs Z41 

318571 Z43383 H&8053 ESTs Z4 

334958 CH22JGENES.465_27 Z4 

323570 AL038623 Hs208752 ESTs; WeaMy similar to HI! ALU SUBFAMJ Z4 

301685 W67730 EST duster (not In UniGene) wffli exon h Z4 

303849 AW163324 EST duster (not in UniGene) with exon h 24 

325702 CH14JegiJ5867028 Z4 

313074 N48261 Hs.127171 ESTs Z4 

308994 AI880051 EST singleton (noth UniGene) w8h exon Z4 

330338 CR08_p2giJ5457162 Z4 

327274 CK01jTsg^5867470 Z4 

325953 Cai6_hsg55867140 Z4 

333281 CH2?JGBffiS.128jr Z4 

314778 AW079559 Hs.152258 ESTs 239 

317C05 AI80G251 Hs.197773 ESTs Z38 

334257 CH2a/GBCS^67J5 Z38 

324783 AA640770 EST duster (not in UniGene) 238 

300949 AA534325 Hs.162183 ESTs Z38 

314957 AW029274 Hs208368 ESTs; Moderate^ similar to till ALU SUB 238 

324350 AW292501 Ks.157174 ESTs; WeaHy similar to similar to SH3-b 238 

338235 CH22_E\tACC0550aGENSCAN^60-16 238 

300937 AW297302 Hs255631 ESTs Z38 

317439 AW451327 Ks.170623 ESTs Z38 

324745 AI742120 Hs.116506 ESTs; WeaJdy strife to UII ALU SUBFAM1 Z38 

3383)6 »C?3fcAC005500.GENSCAN^02-2 238 

318765 Z42071 Hs23961 ESTs Z38 

310254 AB39811 Hs.157491 ESTs Z37 

3CS116 AA649244 EST stogbton (notfo UniGene) wfflj exon 237 

324016 AL04S285 Hs246849 ESTs; Moderately similar to OD ALU SUB 237 

322774 AA131111 EST duster (not In UnSSene) Z37 

335745 CH22JGBCSJ601J6 Z37 

300972 AI979100 H&211518 ESTs Z37 

338809 W223^C005SJ0.GBiSCAri531-10 Z37 

316983 AM8Q204 Hs.177131 ESTs Z37 



DU145.A549.A549 

NOH69.MCF7 f BT474 

NO+C69, NOH345, PRSCjog 

NO-358,NOH23,NCWW60 

Na358 l Na358 t NOti23 

BT474.RPWE4PRSCLcon 

MCF7, fvB-MDA-453, 

S t CALUS,A549 

NCWt545> PRSC.con, RPWE-2 

BT474.LnCap.EB 

NCW23, PRSCjog, NCW520 

PRSC_ccn, PRSCjog, NOK345 

OVCA-R, CALU6, PRSC.ccn 

UCap.DU145,CALU6 

LnCap, MfrMDA-231, CALU6 

BT474, MB-MDA-231, MB-MDA453 

NCJ+W60, NC-MQM53, NOH345 

OVCA-R, PC3, EB 

NOHS9.NOH345. RPWE-2 

Caco2, NO-H460, NCJ-H23 

PRSCjoon. NCW345, MB4DA231 

NCWC45. PRSCjog. PRSCjcn 

PC3.DU145, OVCA-R 

NCW69. RPWE-2, PRSC_con 

NOH23.NOH460.NCW520 

NCW23,NOH40).Caco2 

NCI~K345. NO-H69, RPWE-2 

DU145, LnCap, CALU6 

DU145, OVCA-R 
NCI-H345, RPWE-2, EB 
CALU6. RPWE-2, OVCA-R 
NCI4W60.Na358,NCm520 
PRSC.con, NCW460, RPWE-2 
MCF7. OVCA-R. PC3 
PRSCLcon, PRSCjog, RPWE-2 
RPWE-2 NO-H69.NCW345 
CALU6,DU145,A549 
NCk358.NCW58,NCkH23 
CALU6, HT29.A549 
OVCA-R, Caco2. I^MDA-231 
NOH23.NOH520, NO-358 
NCI-H69, NOH345, PRSC.con 
NO358,NCW23 ( NCU*520 
DU145, PRSC_oon, RPWE-2 
OVCA-R, EB, BT474 
MBAflWW31.NCW345.EB 
RPWE-2 PRSCjog, NOH345 
NO-H23, NOH460, NOU52Q 
M&M)A-231, RPWE-2. PRSCjog 
RPWE-2. EB, PRSCjoon 
DU145.EB, LnCap 
OVCA-R, DU145, MB-MDA-231 
MB-MDA-453.NB-MDA-435s.MCF7 
NCW23.HT29.DU145 
EB,GALU6,Caco2 
NB-MDA^31,BT474 ( HT29 
HT29.NCW58.MBM)A^31 
EB,OVCA^MWIHJA^53 
NCW«9, NCW345. PRSC Jog 
LnCap, DU145.DU145 
HT29.NCW23.NCW23 
NOH69,NC^H460, NCI-H23 
PRSCjog, PRSCjoon, PRSCjoon 
A549.DU145.EB 
NO358.NCm460,BT474 
NOH69. PRSC.con, PRSCjog 
LnCap, NCkC3,NCW520 
OVCA-R, DU145, EB 
CALU6, MBAEM35S, MWWDM53 
EB, DU145, OVCA-R 
OVCA-R, EB.A549 
PRSCjog, PRSC^con, NO-K69 
NCH«9, NOH345. PRSCjog 
NCWC3,NCW^9,NCW520 
NCW345. PRSCcon. PRSC log 
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321308 AI247480 Hs.1 17029 ESTs Z37 

323578 AA299492 Hs.168166 ESTs 237 

335747 CHZLFGBCS.601J0 236 

322% AF039697 EST duster (not In UniGene) 236 

314430 N76302 Hs.78110 ESTs; WeaWy similar to F17A92 [Ceiega 236 

304831 AA586422 EST singleton (not in UniGene) with exon 236 

337432 CH2£J=GENES.76&-1 236 

305384 AA887654 EST singteton (rat in UniGene) with exon 236 

313486 AW134523 Hs247186 ESTs 236 

309028 A1889109 rb212Q32 EST 236 

318292 AI679966 Hs.1 50603 ESTs 235 

334198 CH2£/GENES354_4 235 

314458 A1217440 Hs.143873 ESTs 235 

333346 CH22J=GENES.139J5 235 

325408 CK12Jisgip866921 235 

313758 AA076743 Hs.129770 ESTs 235 

309825 AW293701 EST singteton (not In UniGene) with exon 235 

303536 R55497 Hs.183941 ESTs; Moderate* simfer to H beta 58 ho 235 

331534 N51583 Hs.1 33756 EST 235 

325164 T16S81 Hs21963 ESTs 234 

327710 CHi)4Jisgi)5867860 234 

306351 AA961356 EST singteton (not in UniGene) with exon 234 

304968 AA614308 EST singteton (not In UniGene) with exon 234 

334015 CH22_FGENES313_7 134 

318315 AI091370 Hs.134852 ESTs 233 

306809 AIQ57134 EST singteton (not in UniGene) wfih exon 233 

337697 CH2^^M^CO00O97.GENSCAN.B6-1 233 

33630 CR11j2gip7290a) 233 

326577 Cai9Jogi|5867317 233 

333428 CH22_R3ENES.149 1 233 

301080 AM79391 Hs.1 55405 ESTs; WeaWysimfer to !U ALU SUBFAMJ 233 

324829 AA714311 EST duster (net h UniGene) 233 

302776 AJ133798 EST duster (not h UniGene) with exon h 232 

325801 CK14J»g^6552451 232 

332122 AA60S698 Ks.112389 ESTs 232 

314167 AA243633 Hs208983 ESTs 232 

324023 AA669615 Hs214226 ESTs 231 

320503 NM.00589 EST duster (not in UniGene) 231 

312217 T98289 EST duster (not in UniGene) 231 

321304 AA078293 EST duster (not In UniGene) 231 

323517 AA527359 Hs.154366 ESTs 231 

336455 CrC^GENES^tt 231 

313352 AW292127 Hs.144758 ESTs 231 

331457 H93135 Hs.41840 ESTs 231 

333054 CH2*JGENES.73_8 231 

308598 AI719237 EST singleton (rat In UnfGene) with exon 231 

327059 CH21Jegip531965 23 

334120 CH22J=GENES333_1 23 

324154 AM57449 Hs.1 92817 ESTs 23 

326509 CR19J» gj6682496 23 

316855 AW291384 Hs254974 ESTs 23 

337918 CH22_0AACOO55CD.GENSCAN.6fr4 23 

317471 A1825351 Ha 144084 ESTs 229 

331023 N32599 Hs^856 ESTs 229 

332231 M48008 Hs.1 02629 EST 229 

309912 AW339671 EST singteton (not &i UniGene) with exon 229 

316427 AC41019 Hs.145644 ESTs 229 

313329 AW293704 Hs.1 22658 ESTs 229 

335019 CH22J=GENES.474_7 229 

324394 F2D654 Hs.1 52128 ESTs; Moderately sirn3ar to Uil ALU SUB 2^9 

339357 CH22_BA354l12.GENSCAN.31-2 229 

322128 AI346033 EST duster (not in UniGene) 228 

301310 AC39457 Hs.130794 ESTs 228 

300623 AB29130 Hs.1 18261 ESTs; Moderately similar to finger prote 228 

323409 AL135534 EST duster (not in UniGene) 227 

308406 A1634885 EST singteton (not in UniGene) with exon 227 

322518 AI133446 EST duster (not in UniGene) 227 

338381 CH2^eAAC0Q55C0.GB^SCAN.33O-10 227 

316003 AA704584 Hs.1 19993 ESTs 227 

307090 AI161Q24 EST singteton (not in UniGene) with exon 227 

300356 AA758411 Hs.121335 ESTs 227 

331887 AA431328 Hs£8660 ESTs 227 

330951 H02566 Hs.191268 H sapiens mRNA; cDNA DKFZp434N174 (from 

305547 AA773111 EST singteton (not bi UniGene) with exon 227 

312457 AA776743 Hs.191589 ESTs 226 

333929 CH22JGENES300J 226 



BT474.NC»-H69,HT29 

UtCap,S,f^WDA453 

NCW69.LnCap,PRSCjx>n 

DU145, PRSC„con, NCW345 

DU145, MB-MDA453, CALU6 

NCI-H23, NCI-H460, CALU6 

M&MDA231.BT474.HT29 

DU145, HT29, CALU6 

DU145,A549,CALU5 

NCW58,M(m20,NaH23 

NC1-H460. Caco2, NCJ-H23 

NC W69. PRSCJog, PRSC.con 

Caco2, A549, PC3 

CALU6. DU145,LnCap 

NCW460, NCI-H520. NO-H23 

NCI-H23. mUDMSte, NCW345 

NCJ-B460, NCW23.NOH520 

DU145.CALU6.NOH520 

NOH23.NOH520.Na358 

NCI-H345, PRSCJog, NCW460 

BT474, MB-MDA-231, NCI-H345 

BT474, MB-MDA-231. M&M3A4353 

CALU6, HT29, M&MDA-453 

HT29,MBM)A531,BT474 

CAUJ6 I NCMH520,DU145 

PC3, EXJ145, EB 

RPWE-2, PRSCJog, NCW345 

NCW520, NCW23, NCW460 

NCW460, NCW58.NCW23 

NCW345. PRSCjoon, RPWE-2 

OVCAfi.MCF7.MCF7 

NCI-H460, NCW58, NCI-H23 

NCI-H23, NO-H460, NCI-H52D 

PRSCJog, MCF7, NO-H23 

DU145.HT29.PC3 

DU145.MCF7.PC3 

DU145.NCm345.EB 

A549, OVCA-R. PC3 

NCm23 r Caco2 f NC^69 

EXi145.OVCA-R.EB 

NCW345.DU145.EB 

NCI-H345, PRSC.con. RPWE-2 

MCF7. DU145. OVCAfi 

Caco2, NCUM60. NCW23 

NCWH69,NC!-358,NCkH23 

OVCA-R CALU6, Caco2 

NCI-H460, LuCap, LnCap 

NCW69, RPWE-2, MB-MDA435s 

NOH460. M&MDA453. NO-358 

NCI-H345.CAIU6.0VCAR 

NCI-H345. NOH460.BT474 

RPWE^. NO-H345, PRSCJog 

KT29, OVCAfi, DU145 

OVCAfi.LnCap.A549 

CALU8. DU145, EB 

N&WDA435S. PRSCjan. NCK358 

Caco2.HT29.ffi 

OVCA-R, DU1 45. Caco2 

HT29,CALU6,KBWDA-231 

NOH345. M&MDA-231, RPWE-2 

NO-H69, OVCAfi. BT474 

NC1-H23, NO-H520, NOH460 

OVCA-R. DU145. M^MDA-231 

BT474, RPWE-2, PRSC.con 

NCWC45, N0358,Caco2 

OVCAfi.EB.HT29 

DU145, MB-MDA-435S, OVCA-R 

NCI-H69, PRSCcon, PRSCJog 

NO358.Nf>H520.NC»i23 

NOK345, DU145, RPWE-2 

UiCap.NCm460.Caco2 

NCk358,NaH520.CALU6 

227 OVCA-R, BT474.BT474 

LnCap.DU145.BT474 

NCW345. RPWE-2, PRSCjoon 

Hra.CALU6.EB- 
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319845 AA549011 Ha 1879(2 ESTs Z26 

30S739 AIQ23393 ESTsfnsfefcn(radtnUniGei»)w&exDn 226 

306919 AKJ96832 ESTsfagJ^m(nottnUnGei»)wffliexnn 126 

333312 CH£_FBB€&13M 225 

334955 CH22_FGB£S.465_24 225 

3122% AA57B233 Hs.173863 ESTs 225 

307643 AB02124 EST sfcglelDn (not ki UnEene) with axon 125 

324252 AA421989 EST duster (not In UniGeite) 225 

309767 AW271805 ESTsfa^tetDn(nothlMK3ene)wffliexDn 225 

311492 AW41Q240 Hs.4437 ribosomal protein 128 125 

312260 H05392 Hs230597 EST 225 

327125 CH21JB gl[B531971 225 

316919 AA845382 Hs20452> ESTs 224 

316361 AI433833 Hs.164159 ESTs; Weakly slmQarto (ID ALU SUBFAMI 224 

315772 AW515373 Hs.158893 ESTs 224 

320236 H03688 EST duster (m* to UniGene) 224 

315444 AW138621 Hs221737 ESTs 224 

333903 CH22J=GENES294J 224 

335234 CH2*/GEMES515.3 224 

333727 CH2*_FGENES256J 223 

332002 AA482009 Hs.105104 ESTs 223 

329611 CH.«Lp2gi|3962478 223 

310559 AI783594 Hs. 15571 B ESTs 222 

327315 CHi)1J]sgi]5867508 222 

323170 U83527 EST duster (not in UniGene) 222 

331522 N49309 Hs. 117012 ESTs 222 

313261 AA730472 Hs.142805 ESTs 222 

312740 R97191 Hs.134106 ESTs 222 

325055 Z44631 Ks21658 ESTs 222 

337895 CH22^AAC005500.GB*SCA>i553 222 

307140 A1185762 EST singleton (not tn UniGene} with exon 222 

321643 W76005 I&32094 ESTs 221 

302663 X85153 EST duster (not in UniGene) w2h exon h 221 

322644 AA340904 EST duster (not in UniGene) 221 

330415 D83777 Hs.75137 WAA0193 gene product 221 

302334 AF120491 EST duster (not In UnlGene) wffli exon h 221 

326710 CR20Jtsgi)5867593 221 

323561 AA825426 Hs^38832 ESTs;WeaWysWlartoIfllALUSUBFAMI 221 

337706 CH2^*ACO0OQ97.GB4SCAN.87-11 221 

339309 CH223A354I1ZGENSCAR22-7 221 

330436 HG2724* Oncogene Tb/Chop, Fusion Acfivated 221 

312360 AQ22972 Hs.196073 ESTs 221 

301855 AF053356 muffiple UniGene matches 22 

331192 T55182 Hs, 152571 ESTs; Highly simBar to IGF-H mRNA-btod 22 

315872 AW051819 Hs204516 ESTs 22 

337904 CH223tAC0Q550OGBISCAN.56-17 22 

308258 A1565612 EST singleton (not In UniGene) wifii exon 22 

320965 H18166 EST duster (not In UniGene) 22 

333910 CH23J5BCS295J 22 

300707 AA080921 EST duster (not to Unpens) wffli exon h 22 

336011 CH22J=GB*ES.668_9 2.19 

325712 Cai4jBgq6682473 2.19 

322738 AF201832 EST duster (not In UniGene) 219 

335339 CH2Z_FGBsfES535J6 219 

320733 AA73&436 Hs.134407 ESTs 2.19 

319412 AA679426 Hs.187505 ESTs 219 

337132 CHZLFGBIE&52&4 219 

301544 AB51651 Ks224290 ESTs 219 

325235 Cai1Jsgip866903 218 

338280 CH223ftAC005500.GENSCAN290-1 1 218 

311421 AI701635 Hsl207077 ESTs 2.18 

330638 X89576 Hs.158581 matrix metelopratefcase 17 (mejnbrane4n 218 

326603 CH2Q_hsgp56312 218 

319055 AA412305 EST duster {not to UniGene) 218 

335451 CH22J=GENES^62J 218 

317989 AI203009 Hs.130864 ESTs 2.18 

322024 AA334384 EST duster (not In UniGene) 218 

300734 AW205197 Hs240551 ESTs Z18 

304022 T02990 EST stogteton (not hUnR3ene)wffli exon 218 

330082 CH.19j2gtj6015314 218 

312516 AA363245 Hs.189831 ESTs 218 

333932 CH22JGB*ES3Q0Jj 2.17 

308115 AW79071 EST stogteton (not in UniGene) wffli exon 217 

320184 U91510 Hs.123036 C039«ce1 2.17 

324432 AA464510 EST duster (not in UniGene) 217 

320382 AI832098 EST duster (not to UniGene) 217 



UiteftDU145,MCF7 
M&WDM35s t NCW58,CAUJ6 
HT29.BT474.PC3 
OVCAtf.DU145.PC3 
RPW&2. PRSC.con, NCWB45 
OVCA*.DU145, NCW345 
CALUB, CALUS, OVCA* 
OVCA-R.EB.A549 
DU145,NaH46Q,CALU6 
NCUi59, NCW460,NCUi520 
Caca2, m, DU145 
HT29.N0358.BT474 
NOH23, NO-K345, NO-H520 
DU145.EB.PC3 
OVC^R.S.UiCap 
N0358,DU145,NC14*23 
NCW5B. CALUS, PRSC_ccn 
MBWDA-231.BT474.A549 
NCWB9, PRSC.con, PRSCJog 
MB45DA231, NO-HQ, BT474 
ffi.NCm520.HT29 
BT474, HT29. MB-MQV231 
BT474, MCF7. MB-MQA-231 
NO-H69, EB, S 
ffi.DU145.UCap 
A549. LnCap, DU145 
OVCAfl.PC3.LnCap 
BT474,MCF7,OVCA* 
MB-MDA453, DU145, CALU6 
NOH345, PRSCJog, PRSCjcon 
NOH520, NCW460.EB 
ffi.N04t345.PRSC.ccn 

NOH460, NCW23, NO-H520 

CALU6.A549.Caco2 

NCm9.NO-H345.PC3 

NCW520,NCk358,NCWi23 

NCW345.DUM5.NOH69 

MBM}A435s. NCW58, NCW520 

BT474.HT29.PC3 

PRSCjcon, NO-H69, Caco2 

OVCArR. MB-MDA-435S, DU145 

NCW69.HT29,NO-H23 

OVCA-RPC3, CAUJ6 

UCap.OVCA-R.ffi 

OVCA-R.LnCap.ffi 

DU145,M&MDfc231.CALU6 

DU145.BB.LnCap 

DU145, MB-MDA-231, EB 

BT474.MCF7.HT29 

NCLH460.BT474.NCW345 

NCW460,NCkH23.NCU58 

PC3,RFW-2.PRSCj»n 

HT29. PRSCJog. MCF7 

DU145. BB, Caoo2 

NCUB45. PRSCJog. PRSCjcon 

NO-H59. NO-H345. PRSCjcon 

PRSCjan, MB4BA231, NO-H23 

PRSC.con. PRSCJog, h»AEAr231 

PC3, NCW58, HT29 

RPWE-2, NO-H345. NC1-358 

HT29.MB4fflA^35s,MB4«DA453 

CALU6.DU145.HT29 

A549.0VCMIW&MDA435S 

DU145,LnCap.CALU6 

NO-H345, NC1-H69, NCJ-H520 

Caco2,PC3,NCWfi20 

NCW58.A549.EB 

NOH23,Na358, NOH460 

NO-H23,Caco2,Caco2 

BT474,rn29 ( MB^B5V231 

PC3,Caco2,ffi 

BT474, OVCA-R. OVCA-R 

NCW520.NCW58.NOH23 

CALU6, RPWE-2, HT29 

OVCA-R. PC3, BT474 
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312251 HQ3952 EST duster (not In UniGene) 2.17 

315049 AW340486 Hs.121210 ESTs 2.17 
305016 AA627127 EST singteton (not In UniGene) wffii axon 2.17 

303807 AI792785 Hs.130434 ESTs Z16 

317792 AI553389 Ha.196121 ESTs 216 

321668 AA872730 Hs.125229 ESTs 2.16 

328863 CR07jTsgIj6381929 2.16 

319373 R00371 EST duster (not In UniGene) 2.16 

320069 T86541 Hs.189732 ESTs 2.16 

320235 AF064090 Hs.129708 tumor necrosis factor (EgamJ) superfeml 2.16 

338880 CH22_DJ32l10.GENSCAN.6-2 2.16 

318314 AI091349 Hs.161133 ESTs 2.16 

332696 D86973 Hs.75354 GCN1 (general control of arrtno-acid synt 2.16 

331352 AA406133 Hs.7482 WAA0682 gene product 2.16 

339019 CH22_DA5SH18.GENSCAN^M5 2.15 
306975 AI127042 EST singleton (not in UniGene) with excn 2.15 

318069 AKJ24557 Hs.131540 ESTs 2.15 

312997 AW205686 Hs.135130 ESTs 2.15 

331372 AA433935 H&55044 DKFZP586H2123 protein 2.15 

335049 CH22J=GB^ES.481_5 2.15 

324280 AA429772 Hs.191610 ESTs Z15 

330363 CHJU>2gp126882 2.15 

322896 AW470296 Hs.144830 ESTs Z15 

321981 AA948204 Hs.127361 ESTs 2.15 

333294 CH22_FGENES.130_6 2.14 

330170 CKQ2_p2gij6648220 2.14 

312973 AI123346 Hs/135241 ESTs 2.14 

311104 AIG27352 H&201449 ESTs 2.14 

325086 T10019 Hs.4194 ESTs 2.14 

317182 AW183524 Hs.192298 ESTs 2.14 

323644 AA310711 Hs.124340 ESTs 2.14 
308092 AM74896 EST shgleton (not In UniGene) wShexon 2.14 

322265 AF086244 EST cluster (not h UniGene) 2.14 
303521 AA746272 EST duster (not in UniGene) wBhexonh 2.14 

312102 AW439340 Hs.189720 ESTs 2.14 

316559 AI249468 Hs.228251 EST 2.14 
338486 CH23JMAC005mGENSCAN.3824 2.14 

301302 AI825444 Hs.210956 ESTs 2.14 

310591 AI650372 Hs. 195979 ESTs 2.14 

316231 AA732301 EST duster (not fa UniGene) Z14 

326559 CR19Jisg|5667310 2.14 

324062 AA525291 Hs304099 ESTs; Weakly similar to 111] ALU SUBFAMI 2.13 

323844 AI811303 Hs.143490 ESTs 2.13 

333895 CH2^_FGBJESJ293J2 2.13 

308264 AI567114 Hs.171454 EST 2.13 
300)81 AA908472 EST skeleton (rxrt in UnKSena) with exon Z13 

333101 CH22JK5BIES.79_6 2.13 

328544 CH.07JB g!|5868486 213 

333355 CH22J=GBJES.141_6 2.13 

323397 AJ524519 HsJ239699 ESTs 2.13 
305697 AA814956 EST singleton (not in UniGene) with exon 2.13 

327809 CH.05_hsgJ5867968 2.13 

325092 T10115 Hs.92423 ESTs 2.13 

322299 A1971935 Hs.252784 ESTs 2.13 

312145 AA029526 Hs. 126706 ESTs Z12 

323704 AA319421 Hs.193577 ESTs Z12 

328971 CH.08J*gl|5478806 Z12 

325338 CH.11Jisgij5866883 2.12 

331332 AA282554 H&89034 ESTs 2.12 

327159 CK01J»gip867550 Z12 

335180 CH22_FGBES.5Q5J2 Z12 

338062 CH2^JMAC005500.GENSCAN.162-3 Z12 

318350 AI636018 Hs.135538 ESTs Z12 

312070 AW293140 Hs.108790 ESTs Z11 

328314 CH.07Jisgip868371 Z11 

315869 AI033547 Hs.132826 ESTs Z11 

339246 CHZLBA354I1ZGENSCAN.S$ Z11 

329921 CR16_p2gi]6165205 Z11 

324981 Z25333 Hs.4947 ESTs Z11 

331291 AA159323 Ks.109929 ESTs Z11 

332729 AA058907 Hs.83190 fatty add synthase 211 

325448 CH.ttJ»gJl5866941 Z11 

314929 AW188286 Hs.143612 ESTs 21 

301063 AI057634 Hs. 124596 ESTs 21 

301952 AB029016 Hs.117333 WAA1093 protein 21 

326309 CH.17JisgI|5857277 21 



Nf>H4Q),NCW423 f NCW58 

NCm20,NCW58, NOH23 

M&MDA231, MB-MDA-453, EB 

NCW345. PR$C_con, PRSCJog 

NO«345,PRSC_coii < LnCap 

OVCA*,PC3,MCF7 

PRSCjcon, NCI-H345, NO-H460 

PRSC__con, RPWE-2, NCUK45 

NCW23,NCW58 i NC»4C45 

N(*H23,NCW460.NCI-H520 

BT474,MCF7,OVGfcR 

NO+C3, NO-H520, NO-H460 

A549.PC3.DU145 

PC3, EB, &B-MDA-231 

LnCap, EB, OVCA-R 

MB-MDA435S, NCl4i520, NCW58 

03002, Caco2, BT474 

NO^,NCWi23.NCW58 

PRSC_con,KT29 > CALU8 

NOH69. NOH345, PRSCJog 

MB-MDA453, MB-MDA4353, MCF7 

NCW23.NCW460.NCW58 

HT29,CALU6,EB 

A&MDA231.DU145.HT29 

EB, DU145, MB-MDA453 

HT29.MfrMDA453.PC3 

LnCap.DU145.EB 

NOH520,NO+i23,LnCap 

NOH460.NCI-H23.NCW58 

HT29.BT474.M&WDA435S 

RPWE-2, PRSC.con, PRSCJog 

BT474,MCF7,MB*!aA-231 

NCW345, RPWE-2, PRSC.con 

DU145, MB-MDA-453, EB 

NCWTO,NO-H460,MB^DA-435s 

NCW460.NCU58,NCWt23 

NCW520.NCI-H23.NCI-H69 

BT474,HT29,M*MDA-231 

CALU6,CALU6,Caco2 

NCki23,NO-H520,NCI-358 

DU145. NCWH460, NC1-H23 

OVCA-R, DU145.EB 

MB-MDA453. MCF7, MB-MDA-435S 

CALU6,UCap,DU145 

DU145.CALU6.MB-MDA453 

HT29, BT474. MB-MDA-231 

NOK345. NO-H69, PRSCJog 

NO-H23, NOH69. PRSCJog 

DU145, EB.CALU6 

EB.NOH460, NCi-H345 

NO-H520, NCI-H460, NCt-358 

HT29, PC3, OVCA-R 

KT29, NO-358, MB-MDA-231 

PRSCcon,DU145,DU145 

OVCA-R A549, MB-MDA4353 

Caoo2. LnCap, OVCA-R 

NO-358, NCW23,NCi-H520 

LnCap.NCW69,NCW345 

NCI-H520,NCI-H23 > Caco2 

EB.DU145.PC3 

LnCap,NCW69,A549 

PRSCjcon, PRSCJog, NCI-H69 

EB.KT29.DU145 

Caco2.NOH23.A549 

HT29.NOH23.NCWW60 

BT474,CALU6,MCF7 

CALU6, CALU6, BT474 

BT474,MBW^231,HT29 

A549.NOH460.NOH520 

NCKH345.A549, PRSCjcon 
NCW58,LnCap,MBM>A453 
DU145, MCF7.CALU8 
EB.BT474, MB-MDA-231 
NOH23,NO+W60,BT474 
OVCA-R, A549.GALU6 
MB-MDA-435S, NCI-H69, M&M)A453 
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315408 AJ323453 Hs.146625 ESTs Z1 
302376 AB0O7867 Ha200460 WAA0407 pnrfrin 2.1 
312181 AA417281 Hs.191595 ESTs Z1 
334254 Crfi*JGB*E&366_4 21 

318073 AW167087 Hs.131562 ESTs Z1 
304724 AA569881 H&65114 feuafintt Zt 
332359 W87704 H&211558 ESTs 2.1 
331884 AA4313Q2 Hs38721 EST; WeaW/stetotoNkoptofflsapte Z1 
3JM226 Ai559105 Hs.1 81 165 euteoyo&ctransbtaeto^tof^ 21 
324279 AA501412 Hs.191688 ESTs; WeaMy similar to Pro-PokJUTPase Z09 
337203 CH22J=GB*E&»1-3 209 

322346 AA227618 Hs.1 0882 HMGtac eonbHng protein 1 209 
304470 AA426654 Ks.195188 giycerakJehyoe^osphaie dehydrogenase Z09 
3S977 CH.16_hsgIJ62496Q2 Z09 

304696 AA554758 ESTsingto(natlnUrfGa«)wi8i©ton 209 

317412 AI301528 Hs.132604 ESTs 2j09 
315570 AI86O360 Hs.160316 ESTs 2.08 
327341 CHD1 _rsgij6017016 Z08 

327431 CK02Jsgip867754 208 

314685 AI370811 Ks.158709 ESTs; WeaWy sfmiarto KIAA0938 protein ZDS 
328624 CR07JB g?5868246 Z08 

303596 AW303377 EST cluster (not foUnX^)wfo exon h 208 

336717 CH2?JGBCa81-1 Z08 

317370 AW204139 Hs.1 74424 ESTs; WeaWy sfmiartD p140mDta [Mjihisc Z08 
" " — * Z08 

208 
208 
208 
208 
208 
208 
208 
208 
208 
208 
207 
207 
207 
207 
207 



331287 AA149061 Hs.172971 ESTs 
304211 N62228 ESTshg^on(notlnUnCen8)wahexon 
315613 AW137420 Hs.1 92311 ESTs 
325636 CH14Jisgij58670Q2 
338406 CH22_FGH€SJ23J21 
301714 F06529 EST diister (not h UniGene) wffii exon h 

300496 R45159 HsZ21804 ESTs 
318970 R21114 H&21383 ESTs 
334115 CH22JGB1ES-330J5 
303082 AM73682 EST singleton (not In UniGene) wffli exon 

308282 A1569456 EST singleton (not In UniGene) wiffi exon 

313038 AW451618 Hs.124195 ESTs 
317974 AW444468 Hs.144900 ESTs 
324063 AW292740 Hs254815 ESTs 
334759 CH22_FGBJES.428JB 
307864 A1367417 EST singleton (not in UniGene) with exon 



304356 AA196027 Hs.1 95188 gtyceraMeJiyae-3-phosphate dehydrogenase Z07 
303929 AW470753 EST smgtetcn (not In UniGene) with exon 207 

331857 AA421160 HsS456 SW1/SNF rented; matrix assocd; actin de Z07 
322814 AI824495 H&211038 ESTs Z06 
303650 AA430709 EST cluster (not In UniGene) wifo exon h 206 

333403 CH22_fGB€ai44JM 206 

313663 AI953261 Hs.169813 ESTs ~ Z06 

338594 CH22^UX0550aGB4SCAN.43&4 206 

334676 CH22_FGB4E&418_29 Z06 

310046 A1198032 Hs21Q356 ESTs Z06 
309169 AI949216 EST sfctfeton (not fa UniGene) with exon 206 

33752 CR14_p2gi|6065777 Z06 

325085 T10001 Hs.4188 ESTs Z06 
332062 AA521016 Hs.1 85375 ESTs Z06 
302074 AA382871 Hs.1 32794 phosphate cyfidyiyttranstefase 1;choIh Z06 
326344 CH.17J»gfl6525295 206 

330855 AA079318 zm98cZs1 S&atagene colon HT29 (#937221 

lMAGE5459543 , ,mRNAseq Z06 
302525 AF024690 Hs248056 G protetooupted receptor 43 205 
331903 AA436S73 HsJ29417 H sapiens nflNA; cDNA DKFZp586B0323 (from 
316322 AW296618 Hs.1 20637 ESTs 
321525 H78875 EST duster (not In UniGene) 

305071 AA640579 
326033 
334730 

305335 AA704235 
32)521 N31464 
333515 

311020 AI918672 
324323 AA393739 
305486 AA748889 
312162 TBI 823 
330980 H28794 

317463 AA9272S0 Hs.1 30462 ESTs 
303460 AA700155 Hs.117900 ESTs 
337435 CK22J=GB*ES.766-2 



HsZ4743 



EST singleton (not In UniGene) with exon 
Cai7_hsgJl5867178 
CH22J=GENES.424J5 
EST singleton (not In UniGene) with exon 
ESTs 

CH22JGBCai72_5 
Hs213783 ESTs 

EST duster (not In UniGene) 
EST singleton (not nUnQene)wffii exon 
EST duster (not in UniGene) 
Hs*659 ESTs 



205 
205 
205 
205 
205 
205 
205 
204 
204 
204 
204 
204 
204 
204 
204 
203 



OVCA-R DUU5.B 

OVCA-R, Caco2,HT29 

OVCA-R, A549.DU145 

U*^ OVCA-R, DU145 

A549,CALU6>EB 

NCW23, NCW620, NOH460 

MB-MDA435S. PRSC.con, NOH460 

NO+B45, HB-MQA-231, PRSCjcon 

EB.CALU6\OVCA* 

OVCA*. LnCap, PC3 

NOH69, NCW345, MBWDA231 

HT29,BT474.MB*ffiy«31 

NCUC3,CAU)6,NCW520 

NCW23,NCRfi20 I KT29 

MB^DA435s, NCW23, BT474 

Caco2,K,NCKJ58 

PRSCjcon, PRSCJog, NC W345 

M&M5A231, TOSCcon, NCJ-H69 

NCW23,N0358,NO+e2) 

MB-MDA453, MCF7, OVCAX 

MCF7,NCW58.RPWE^ 

RPWE-2. PRSC.con. PRSCJog 

0T474,HT29,MCF7 

NOH23, NCW460, NOH69 

OVCA-R ffi,NCtt045 

BT474,MCF7,M&MDA-231 

PRSCjcon, PRSCJog. PRSCJog 

NO-358, NOH460, MB-MDA-453 

HT29,K,DU145 

LnCap, PRSCJog, PRSCjcon 

PRSCjxn. LnCap, RPWE-2 

NCIU23, NOH520, NCI-H460 

BT474,NOH69,HT29 

M&WDA4353, NOH345. MB-M3A231 

LnCap, BJ, PRSCjcon 

NCLH345, PRSCjxn, LnCap 

NCU58,NCl-H23,Na-H520 

Caco2,NCU58,NCW520 

CAUJ6,HT29 t NOH520 

NCWM60,Na358,NCW23 

HT29, MCF7,MB-A/£)A-435s 

NOH345, PRSC.con, RPWE-2 

EB.A549.PC3 

PRSCjsn,RPWE4,Caco2 

RPWE-2. NO-H345, PRSC.con 

OVCA-R CALU6.PC3 

NOH345, OVCA^, NCm23 

DU145, LnCap, EB 

NOH69, PRSCJog. PRSCjcon 

MB-WDA435S, NCI-H23, Caco2 

CAUU6. EB, NC^358 

CALU6.HT29.DU145 

EB, OVCA-R. MB-MDM35S 

OVCA^MB^DA^53.MCf7 

LnCap, EB,NCW«9 

HT29,BT474.M&WnA453 

RPWE-2, LnCap, PRSC jam 

Na358,NOH23,DU145 

205 Caco2.DU145.A549 

BT474, M&MJA453, OVCA-R 

NCRJ23, PRSCjcon, NCI-H520 

MB4WDA-231.BT474.HT29 

HT29,DU145,BT474 

BT474,ffi,OVCA* 

MCF7, OVCAR M&MDA453 

MB^DA453,Affl-MDA-231,PC3 

NO-H345, RPWE-2. PRSCjXn 

NC^H46aNCWi23, NCi^i520 

OVCA-R, PC3, LnCap 

NCI-H345, PRSCJog, CALU6 

NCW520, NC^H23, 

MCF7, MB-ftfflA453, MB-MDA-435s 

NCW23 t Caco2,NCm69 

DU145,B,CALU6 

NOH345,OVCArR.UCap 
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305464 AA742425 


EST sfrgfefon (not to UniGene) with exon 


£03 


307918 AH83496 


EST singleton (not fn UniGene) wffli exon 


2.03 


322209 H89360 


EST duster (not in UnHSene) 


2.03 


310295 AW2G5198 Hs.149146 ESTs 


2.03 


325886 


. CH.16jBgIp867D87 


2.03 


329719 


CH.14jp2gi[6W5785 


2.03 


309247 A1972763 


EST singleton (not h UniGene) wffii exon 


2.03 


328277 


CH07JB 9(6004471 


2.03 


307296 AJ205705 


Hs.147222 EST 


2.03 


327203 


CR01J» gi|5867447 


2J03 


306866 AJ086683 


EST singleton (not In UniGene) with exon 


2.03 


333339 


CH22J=Ge<ES.139_8 


2.03 


323115 AJ921875 


EST duster (not h UniGene) 


2.03 


304811 AA584361 


EST sktgteton (not In UniGene) wtfo exon 


2.03 


323372 AL135125 


Hs.13913 ESTs 


2.02 


312854 AAB28713 


EST duster (not In UniGene) 


2.02 


307904 AO81019 


EST singleton (not in UniGene) wffli exon 


102 


332099 AA608983 


a^s1SoaresJes8sJ*KTHsaptenscDN 2J02 


324634 AI684571 


Hs.175831 ESTs 


2.02 


335721 


CHZLFGENES£99^4 


102 


312452 A1692643 


Hs.172749 ESTs ■ 


2.02 


325396 


CH.1^hsgi|5866921 


101 


328770 


CH.07_hsgJ)6017031 


2.01 


335585 


CH22JGB4ES581^4 


2.01 


335634 


CH22LFGENESx584J4 


2.01 


338271 


CH2^_BAAC005500.GB4SCAN287-1 


idi 


328607 


CHX7J»gi|5868233 


2.01 


307050 AI147341 


Hs.146734 EST 


101 


334946 


CH22LFGBES.465J3 


Z01 


319793 R56360 


EST duster (not tn UniGene) 


101 


307223 AI193698 


Hs.184776 ribosomal protein L23a 


101 


312627 AA344698 Hs.133169 ESTs 


101 


329221 


CHJUra 9^5868727 


101 


305145 AA653589 


EST slngiton (not in UniGene) wffli exon 


101 


328428 


CtMJ7Jisgi|5868417 


101 


305990 AA888866 Hs.125919 EST 


101 


319368 R00003 


Hs.133171 ESTs 


2 


324805 AA9270Q2 Hs.131350 ESTs 


2 


301138 AA719179 Hs.189419 ESTs 


2 


304675 AA541740 


EST singleton (not 6n UniGene) with exon 


2 


326194 


CR17_nsgf|5867213 


2 



CALUB i NCW520,NCW58 

NCW23,BT474,M&^.231 

DU145 ( 0VCARM&MDM53 

NCRt23 l NaH460 l NO358 

NCHI345.NCHtt45,RPWE4 

NC&H59, RPWE-2, PRSC.con 

UiC^PRSC.ODn, RPWE-2 

LnCap, RPWE-2, A549 

NCVH460,NC^358 I NCW23 

HT29 f BT474,MBWDA-231 

BT474.NCW345.HT29 

KT29, DU145, CALU6 

BT474,BT474.M&-MDA-231 

NCW23 l NCW58 1 NCWi460 

DU145, EB.A549 

NOH345, PRSC.oon, PRSCJog 

KT29, MCF7. MB-MDA-453 

PRSC jan, NOH345, RPWE-2 

NCI-H460, Cacol NCf-358 

KCM469, PRSCJog, 

H729.Caco2 > M&WDA-231 

HT29.NCW520,NaH460 

NCRC3,NaH460,NO358 

W»JADM53,DU145,MCF7 

NCI-H23, NCW460.NOH69 

MCF7.DU145.PC3 

NCW4450, NCRt23, U0358 

NCW520,NCWi23.NO358 

CALU6.BT474.DU145 

NCUU60,HT29,Na358 

NCM58,NCW520^NCW23 

PC3,LnCap»MB-hm331 

NOH345,NCW69.N0358 

LnCap.S.OVCA* 

NCRJ69, M&WDA453, BT474 

NCVH520,NCW58 f NOH23 

OVCA*,LnCap,PC3 

NaH460,NCW23,NCW58 

NC^H69 f NCW423,PRSC„con 

NCW460, NCW520, M&MDA-231 

HT29,N0358,BT474 
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Table 5: H chip - B survivor vs Met query - up in Mets 



Pfccyi Up&pc Eos probesct identifier uumbo 

ExAcck Exemplar Accession number, Gccbank secession number 

Um'genc Title: Unigcoc gcoc titk 



Ptey ExAccn UniGJD Compteta_Tffle Ratio HetfB surv. 

102193 U20758 H&313 secreted phosphoproteln 1 (osteaponfin; 5.56 

12&530 AA504343 Hs.183475 Homo sapiens done 25CS1 mRNA sequence AJSZ 

129093 AA262710 Hs.108614 WAA0627 protein 423 

124690 R05818 H&173830 ESTs 196 

115558 AA 393806 Hs.1010 rec^^oCn^spirafeassenibfyl 339 

134261 AA227678 H&8Q84 Hunn DMA sequence from done 465N24 on c3.22 

104792 AA029288 Hs^9147 ESTs; Highly simfer to ZINC RNGB* PROT 117 

133770 M69197 Hs^42279 haptogtocto-reJated protein 107 



131 



WO 02/068677 



PCT/US02/06001 



Table 6: H chip - B survivor vs Met query - down in Mets 



Pkcy: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Gcrihank accesrioa nmriber 

UnigcnelD: Unigenc number 

Unigcae Tide: Unigcuc gap title 



Ptey ExAccn UnlOJD Completejftle Ratio Met/Bsurv. 

100116 D00S54 Hs.77443 acfin; gamma 2; smoom muscle; enteric 0.07 

101923 S75256 HNl^neutrophi DpocaBn [human, ovarian 02 

129982 M877B9 Hs.140 krwnunoglocijlln gamma 3 (Gm maiter) 02 

130064 T67053 Hs.181125 Inrnwnogfobulin lambda gene cluster 02 
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Table 7: I chip - B survivor vs Met query - up in Mets 



ExAccxk Exemplar Accession manner, Genbank acccsson number 

U iiigcn cID: Umgcnc cumber 

Umgeae Tifte: Vagen gene tttb 



Pkey Exjtacn UnlGJD TTtte Ratio Hetffl surv 

319379 T91443 H3.193SS3 ESTs ia 65 

321920 K63915 n 5 

324302 AA543C03 Hs.136806 ESTs; WeaJdy sinter to DU ALU SUBFAM1 931 

314522 AJ732331 Hs.1 87750 ESTs; MaterateJy sfmfer b> Ull ALU CtA 5.79 

331433 H68097 Hs,161023 EST 479 

324643 AJ43835S Hs.130729 ESTs 439 

332471 AA416967 Hs.1 20980 nudear receptor CMeprcssor 2 458 

314915 AA573072 Hs. 187748 ESTs; WeaWy similar to M ALU SUBFAM1 43 

321354 AA078493 EST cluster (not hUniGene) 426 

322309 AF086372 ESTctetarfnotiiUnjGene) 339 

325100 T1Q265 Ks.116122 ESTs; WeaMysMar to coded for by C. 331 

314071 AA192455 Hs.188690 ESTs 3.74 

315178 AW362945 Hs. 162459 ESTs 3.66 

330987 H40988 Hs.1 31 965 ESTs; WeaHy similar to 101 ALU SUBFAM 331 

337898 CH2*_BMC0QamGENSCAfl3o%5 321 

319403 T98413 EST cluster (not hUniGene) 12 

331469 N22273 Hs39140 ESTs 3.15 

331549 N56866 H&237507 EST 3.14 

331644 T99544 Ks.173734 ESTs; WeaWy simflar to HI! ALU CLASS B 3.14 

313220 AJ971981 Hs.118241 ESTs 3D4 
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Table 8: I chip - B survivor vs Met query - down in Mets 



Plocyr Unujtic ffflfl pidbeset identifier number 

BxAccos Bxeniplflr Acee srfoo nnnibcTp Ocofamfc ftcctsrion nmober 

UcigeDcIDt Unigene number 

Unigene Tide: Unigene gene title 



Pkay ExJVccn UniGJD 


Titte 


Batfn Mat/R eitru 
KoUO WmUD SUlV 


333658 


CH22_FGENES.241J 


U.UO 


333657 


CH22JGENE&241.2 


0JJ7 


333654 


CHZLFGBE&240J 


0.07 


332859 


CH22JGENES.27J2 


0.07 


333656 




O07 


304480 M430373 


ESTsingteton{notInUniGene)wfthexon 


0.08 


333737 


CH2U=GENES^61J 


O09 


3080)1 AI719930 


EST singleton (not in UnK3ene)wfth exon 


01 


334030 


CH2?_FGENES320J 


01 


333637 


CH2?J : GENESJ229J 


013 


302347 AF039400 Hs. 194659 chloride channel calcium activated; fam 


016 


333653 


CH22JrGBtESJ239J. 


016 


333635 


CH22J=GBJES.228_2 


019 


333647 


CHZLFGENES.235 J. 


019 


307588 A1285535 


EST singteton (net In UniGene) with exon 


0.2 


337954 


CH2?_EMtAC005500.GENSCAN^3 


02 


333588 


CH22J=GENE&206_2 


021 


320244 AA296922 Hs.129778 gastroWesflnaJ peptide 


022 


333642 


CH22J=GENE&231_2 


023 


337951 


CH2^MAC005500.GENSCAN^4-1 


023 


333730 


CH2£J=GENE&258J 


023 


333646 


CH2^FGENEa234J 


024 
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Table 9: H chip - B survivor vs Met query - up in Mets 



Es^Acco* £^Gcxnpfaff Accession unoifac^ Gcufasiilc secession tsuzobcf 



Ptey ExAccn UniGID CcrapteteJBie RtedbnBrfsAIvsBedbnBSurAI 

100655 HG2841-HT2970 Albumin, Aft. Spflce 5 1138 

124875 R70506 Hs207693 EST* Weakly sJmflar to UD ALU SUBFAM1 021 

102193 U2D758 Hs313 secreted phosphupiukdn 1 (cstecponfin; &73 

100654 HG2841-HT2969 Albumin, Aft. Splice 3, tfsspfcmg In AflDaQjuirin Venezia 6.18 

118828 K79496 Hs.50824 EST 533 

128046 AA873285 Hs.137947 ESTs 54 

128898 D14446 Hs.107 fibrinogen*! 5.17 

127917 AA211895 Hs.1 18831 EST;HgWyslmDartodJ1163J1i1 pis 5.11 

125090 T91518 ye20S5^1 Sfcrtagene tag (#937210) Horn 4.47 

118579 N68905 sn^h4jcftbcytDto»A5(RAMTES) 423 

123526 AA608657 ESTs; Moderately sfeitortolfll ALU SUB 421 

128062 AA379500 Hs.193155 ESTs 4.14 

119174 R71234 yi54cQ&s1 Scares placenta Nb2rP Homo sa 4.11 

128530 AA504343 Hs.183475 Homo saptens done 25061 mRNA sequence 4D9 

119404 T92950 ye27c10.s1 Strategene lung (§937210) Horn 338 

118475 N66845 Hs.1 65411 ESTs; WeaWysiraflartollfl ALU CLASS B 336 

129974 KD0629 Hs.1 99300 Humankpni repeat mma(coteckmepcd-lc 187 

108888 AA135606 Hs.189384 ESTs; WeaWy sirrfe- to IIIl ALU SUBFAM 335 

123963 C13961 H&210115 EST 33 

123523 AA608588 Hs.193634 ESTs 3.76 

128230 AA984074 Hs.1 76757 ESTs 3.75 

124090 H09570 Hs.143032 ESTs; WeaHy sJmHar to neuronal fluead 3.67 

124690 R05818 Hs.173830 ESTs 338 

134261 AA227678 Hs3084 Human DNAseqim»fromckjne465N24 on 337 

126917 AA176225 Hs.1 93929 ESTs > 332 

126050 H27267 Hs.75860 hydraxyacyl-Coenzyme A dehydrogenase/3-k 3.45 

126649 AA856990 Hs.125058 ESTs 3.42 

115096 AA255991 Hs.175319 ESTs 3.4 

129906 H39216 H&239970 ESTs;WeaWysbnfarto2NF91LtH^apien 338 

123022 AA480909 aa28f10.s1 NCLCGAPJ3CB1 Homo sapiens cD 338 

106145 AA424791 Hs3734 WAA0679 protein 338 

125191 W67257 Hs.1 38071 ESTs; WeaWy similar to D9 ALU CLASS B 336 

108836 AA132061 H&222727 ESTs;WeaWys^touWquBousTPRm 33 

13710 J04813 Hs/104117 cytochrome P450; subfaniiy DIA (niphecfi 327 

123460 AA598981 H&251122 EST 325 

133735 AC002D45 Hs251928 nudear pore complex Irrteraefing protein 324 

124696 R06273 Ha 186467 ESTs; Moderately srmfer to IB! ALU SUB 324 

120748 AA303153 Hs237994 EST; Weakry stmSarto IIQ ALU SUBFAMIL 321 

133770 M69197 Hs242279 haptoglobiwebted pratefri 3.17 

128336 AE42720 Ks.146043 ESTs;WeaidysrmibrtoaAemafivelysp 3.14 

135357 AA235803 Hs.79572 catnepsfn D (lysosomal aspartyl protease 3.12 

128088 R02443 Hs.186467 ESTs; Moderatefy dmOarto RH ALU SUB 338 

124055 F10904 Hs.100516 Homo sapiens done 23605 mRNA sequence 336 

124896 R82063 Hs.101594 EST 336 

127598 AA610677 Hs.168851 ESTs 334 

116802 H44061 Hs.194026 ESTs 331 
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Table 10: H chip - B survivor vs Met query - Down in Mets 



Pkcy: Unique Eos probestt identifier puinbcr 

ExAceo* Excxnptar Acccssioo mmbcf) Ocrib BDlc fli ccssfoo nrnnfaer 

UnigcDelD: Unigene number 

Unigenc Tide: Umgene gene title 



Pkey ExAccn UniQJD Comptete„Tffle Ratio Met/B sure. 

100116 D0C654 Hs.77443 acfin;gamma2;smooft muscle; enteric 0.09 

130064 767053 Hs.181125 InimunogtobuSn lambda gene duster 0.11 

129982 M877B9 Hs.140 ImmunogtoUifin gamma 3 (Gm maricer) 0.12 

131219 C00476 Hs24395 smal fcrxfudbte cytokine subfamty B (Cy 0.13 

133806 M12759 Hs.76325 Human Ig J chain gene 0.17 

132982 LQ2326 Hs.198118 immunoglobulin bmbda-lika pdypep&Je 2 0.18 

131713 X57809 Hs.181125 Iminunogtobidbi lambda gene duster 0.18 

131791 S71043 H&32225 Immunoglobulin alpha 1 02 

133725 V00563 Ks, 179543 bnmunogtobufin mu 022 

101923 S75256 HNL=neutrophi] fipccalln [human, ovarian 023 

101461 M22430 Hs.76422 phosphoGpase A2; graup IIA (platefets; 024 

103448 XS9133 H&204238 Bpocalin 2 (oncogene 24p3) ._ 024 
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Table 11: H chip - Met vs Normal query - up in Mets 



PJccyi Ujdjjbc Eos probcsct identifier Dumber 
ExAoco; Exemplar Acccsstosimnba; Gcsbanka 

t&rigene TTflftr Uxngcns gene title 

Pkey ExAccn UniGJ) CompJrfBjTSiB Median Mete AJvsHedon Kom^Al 

100655 HG2841-HT2970 Albumin, AB.Spfce 5 1531 

102133 U20758 Hs313 secreted phosphaprntein 1 (ostecpenfin; 083 

124875 R70506 K&207693 ESTs; Weakly sWIar to DJ1 ALU SUBFAM1 658 

100654 HG2841-H729© AlhirohAaSp!]ce3,Misspfi^ 523 

124059 F13673 H&93768 ESTs &11 

128896 D14446 Hs.107 fiMncgen-Gke 1 5X5 

134453 X70683 H&83484 SRY (sex <teto**g region Y>tox 4 432 

131564 AA491465 H&2&792 ESTs 4.78 

127917 AA211895 Hs.118831 EST;HigWysimaartodJ1163J1i1 [H.s 4.76 

115096 AA255991 Hs.1 75319 ESTs 4.67 

104558 R56678 Hs58959 Human DNA sequence to done 967N21 on 4.63 

123526 AA608657 ESTs; Moderately state to Oil ALU SUB 4.61 

125090 T91518 ye20J05LSl Stratagem lung (S937210) Horn 458 

1S666 M77349 Hs.1 18787 transfoimhggrowft factor, bea-Muced 458 

118828 N79496 Hs50324 EST 456 

128046 AA673285 Hs.137947 ESTs 4.45 

133421 AA436560 HsJ327 daixfinl 4j09 

129158 J05257 Ha.109 dlpepfidase 1 (renal) 4.04 

128062 AA379500 Hs.193155 ESTs 4.03 

124696 R06273 Hs.186467 ESTs; Moderately simiar to Oil ALU SUB 4.01 

118475 N66845 Hs.1 65411 ESTs; Weakly similar to flfl ALU CLASS B 336 

104755 AAQ24482 Hs3Q29 DKFZP434G032 proteii 333 

104978 AA088458 Hs.19322 ESTs 3.74 

118579 N68905 small todudbte cytokine A5 (RANTES) 3.7 

1237% AA620380 H&247444 ESTs 352 

127240 AA888387 Hs243845 ESTs; Moderately similar to !I11 ALU SUB 351 

104105 AA422123 Hs.42457 ESTs 355 

12S349 D86974 Hs.1 10613 KIAA0220 protein 354 

119329 171832 ESTs; Moderately similar to till ALU SUB 353 

114617 AA084148 Hs.110659 ESTs 352 

123143 AA487595 aa95eZs1 Stratagene fetal retina 93722 3.48 

103119 X63629 Hs2877 cadherin 3; P^adherln (pfecenbJ) 148 

119404 T92950 ye27d0.s1 Stratagene lung (#937210) Horn 347 

123963 C13961 Hs210115 EST 347 

116480 C14088 Hs.195188 cJyceraioerryde^ 3.4 

108836 AA132061 Hs222727 ESTs; WeaWy similar to ubiquitous TPR m 339 

120748 AA303153 Hs237S94 EST; WeaMy simfiarto Utl ALU SUBFAMfl. 138 

133770 M69197 Hs242279 haptcgtabirHelated protein 33 

132358 X60486 Hs.46423 K4histonefamDy; member G 337 

127759 A1369384 aryfeufetaseD 337 

129095 L12350 Hs.108623 fluembospondin 2 337 

128261 AKJ61213 Hs.13179 ESTs; Moderately simBarto UU ALU SUB 336 

126908 AA16986S ESTs; Weakly similar to UU ALU SUBFAM1 33 

128954 N32118 K&209100 DKFZP434C171 protein 334 

119174 R71234 y&4dfta1 Soaies placenta Homo sa 333 

106687 AA463234 Hs.1 19387 WAA0792 gene product 332 

128230 AA984074 Hs.176757 ESTs 33 

126649 AA856990 Hs.125058 ESTs 125 

124620 N74051 Ha.194092 ESTs; WeaWy similar to ID] ALU SUBFAM 124 

135427 AFFXcontrot human afu repeats 323 

129967 H99653 Ks.138618 ESTs 322 

125191 W67257 Hs.138871 ESTs; WeaWy similar to Oil ALU CLASS B 12 

124684 R02401 Hs221078 ESTs 12 

128010 AA856953 H323348 S-pnase Hrase^ssocbfed protein 2 (p45 117 

119423 T99544 Hs.173734 ESTs; WeaUysfmSar to Oil ALU CLASS B 116 

123022 AA4809 09 aa28f10.s1 NCt_CGAP_GC81 Homo sapiens cD 3.15 

103654 Z70759 H^a{^maodtc«lrial16SrRNAgen8(p 113 

128336 AE42720 Hs.146043 ESTs; Weakly similar to alternatively sp 112 

124690 R05818 Hs. 173830 ESTs 11 

129791 F02778 Hs.173887 WAA0876 protein 107 

114472 AA028924 Hs.177407 ESTs; WeaJdy s&riar to Dfl ALU SUBFAMI 107 

115429 AA284139 Ks59295 EST 106 

130020 AA433930 H&240443 ESTs; Weakly state to HNK-1 suMans 106 

126050 H27267 Hs.75860 hyotayacyWfcen^ 
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129906 H39216 H&239970 ESTs; WeaHy sinter to ZNF91L [Rsapien 3.04 

123422 AA598484 H&238476 EST 3.03 

103059 X57351 Hs.174195 Werfercn Induced tranaiiBinbianepfDtBki ZM 

124253 H69742 Hs.102201 ESTs 102 

123523 AA608588 Hs.193634 ESTs 3D2 

132669 AA188378 Hst546Q2 ESTs; Weakly similar to 60S RIBOSOMAL PR1Q2 

123186 AA4S9250 Hs£9403 serine palmitoyftosferase; stibunit II 101 

122948 AA477483 zu44hZs1 Scares ovaiy tumor NbHOT Homo 3.01 

119053 R11501 yf28f1^1 Scares fetoltiverspteenlNFL m 

125S53 H40829 yo05d1in Scares adult brain N2)5HBffiY 3 

119155 R61715 Hs.138237 ESTs 3 
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Table 12: H chip - Met vs Normal query - down in Mets 



Vkcyi Unique Eos probeset tduilifia Dumber 

ExAcdE By^wy br TTTTTritCT, GffPblDk UrCTJajoil gafflbtT 

Umgcne Title: Thrigmt gene title 



Pkey ExAccn UniGJD Completejffia Uss&m Mete AJ vs Hedbn Normal AI 

034® Y0Q339 Ks.155097 catamfc aimydrase II Q.01 

104258 AF007216 H&5462 sotutecanfer{an%4;sodIurabicaiixHi 002 

08399 AA156084 Hs.72115 ESTs 004 

01046 mm Accession not fisted foGenbank O04 

33565 H57056 H&204831 ESTs O05 

01346 L76465 Hs.77348 hyteyprosiaglarefin dehydrogenase 15<N QS& 

123137 AA487468 Hs.100686 ESTs; WeaMysmi^ to secreted cement OSS 

I34534 X73501 Hs34905 K Sapiens mRNAfercytotefafin 20 0.05 

18823 M79237 Rs-50813 ESTs; Weakly sWfar to long chain My OQS 

02095 U11313 Hs.75760 sterol earner proteki 2 006 

11855 R37362 Hs31351 ESTs O06 

13105 AA224351 Hs.108681 ESTs 007 

30320 U19495 H&237358 stromal celWerived factor 1 O07 

13778 W15263 H&5422 ESTs 0.07 

16788 H25836 H&83429 tumor necrosa factor (Bgarri) superfami O07 

00116 D00654 Hs.77443 acfin; gamro 2; smooth musde; enfefic 0.07 

04636 AA004415 Hs.106108 ESTs 0.07 

07032 AA599472 r&247309 succwate^ T^ise; GDP-famrag; befa O08 

106605 AA457718 H&211Q3 Homo sapiens mRNA; cONA DKFZp564B076 (fr 0.08 

128908 AA487557 Hs.10706 ESTs 0.08 

130016 AA055811 Hs.143131 transmembrane glycoprotein 0.08 

'13523 T90037 Hs.16686 ESTs 008 

02638 U67319 Hs3216 . caspase 7; apoptos&related cysteine pr O09 

124308 H93575 Hs*227146 rk>mo sapiens mRNA; cDNADKFZp554J142^ 0.09 

29519 AA29878S Hs.112242 ESTs 0.09 

34749 L10955 Hsj89485 carbemtoanhydrase IV O09 

30366 L11708 Hs.155109 hydraxysterokl (17-beia) dehydrogenase 2 0.09 

09272 AA195718 H&86030 ESTs O09 

02124 U14528 H&29981 solute carrier family 26 (sufatetransp 01 

132711 N73702 H&238327 ESTs 01 

131861 D11925 Hs.184245 WAA0929 prote* Msx2 Intaracfing midea 01 

133808 M12759 Hs.76325 Human Ig J chain gene 01 

02571 060115 Homo sapiens sfcefeteJ muscle UM-pro&h 01 

14846 AA234929 Hs.44343 ESTs 011 

131328 V01 512 H&25647 v^FW murine osteosarcoma viral onco 011 

06569 AA455983 Hs.117816 sorch 011 

03542 Z11793 H&3314 selenoprotein P; plasma; 1 011 

128915 C02385 Hs.107138 ESTs 011 

2)914 AA377254 Hs^7107 EST 011 

30867 J04093 Hs2056 UDPglycosyfoansferase 1 011 

10837 N30796 Hs.17424 ESTs; Weakly sirn3ar to semaphortn F [H. 012 

01877 M97496 Hs.778 guanytate cyclase activator 1B (retina) 0.12 

132617 AA171913 Hs-5338 carbmfeanhydraseXIl 012 

129113 AA147646 Hs.108740 DKFZP586A0522 protein 0.12 

133435 T23983 H&7365 ESTs 013 

328% F09557 H&57929 sa(Drosopr£a)rromotog3 013 

25832 AA6286Q0 Hs.1 17587 ESTs 013 

04613 AA001049 H&24713 Homo sapiens mRNA; cDNA DKFZp36G0123 (f 013 

32903 AA235404 Hs£985 Homo saptens clone 25186 mRNA sequence 013 

19479 W32Q94 Hi55501 ESTs 0.14 

31273 AA421139 Hs.173542 ESTs 014 

06674 AA4613G3 HsJ946 DKFZP586D1519 protean 0.14 

08980 AA151676 Hs33455 pep8dyl ajgime deirmnase; type II 014 

03211 X73079 H&205126 prfymerfcimmuno^obu^ 014 

31219 C00475 Hs^4335 smaS induefcte cytokine subfamily B (Cy 015 

16459 AA621399 Hs£4193 ESTs 015 

130219 R77539 Hs.15285 ESTs 015 

13863 W68388 HsJ?1288 ESTs; Weakly slmfer to WAA0704 protein 015 

01564 M32886 Hs.117816 sorrfn 015 

\0BSC2 AA233837 Ha.44755 ESTs; WeaWysimBartornen^Tariag^ccfr 015 

107222 D51235 H&82689 tumor re]ecfion anflgen (gp96) 1 015 

135237 AA454930 HslS691 ESTs 015 
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112483 R66534 Hs.28403 ESTs . 0.15 

132387 R70914 Hs.8997 heat shoe* 70kD protein 1 ai5 

130343 AA490262 Hs.15485 EST* Weatfy sMar to APICAL-LIKE PROT 0.16 

105496 AA256323 Hs25264 DKFZP434N1 26 protein 0.16 

104037 AA372630 Hs.100347 dfflereniialy expressed mhematopotefi 0.16 

101461 M22430 Hs.76422 phosphatase A2; group UA{pbtelete; 0.16 

116551 D20458 Hs229071 EST 0.16 

133889 AA099391 Hs211582 myosin; Dght polypeptide kinase 0.16 

103653 Z70295 Hs.32966 guanytetec7dase3dhrat^2B (uroguany ai6 

101070 L02785 Hs.1650 down-regulated In adenoma 0.17 

131501 AA121127 Hs.181307 H3 history famBy3A 0.17 

133515 X98311 Hs.74466 cardnoemcryonJc antigerweJated cell ad 0.17 

108604 AA039820 Hs.49696 ESTs 0.17 

132982 L02325 Hs.198118 toimmogbbulin lambda-fee polypeptide 2 017 

131676 C207B5 Hs^0514 ESTs 0.17 

134675 AA250745 HsJ7773 protein kinase; cAMP-dependent; catafyti 0.17 

133441 M82S62 Hs.179704 mecrtn A; afcha (PABA peptide hydrolase} 0.18 

130455 X17059 Hs.155956 N^cetyttransfeiase 1 (aryiamine Nicety 0.18 

131734 D62965 HsJ31297 ESTs 0.18 

100749 HG3521-HT3715 Ras-RelatedProtetoRap1b 0.18 

116724 F13665 Hs.65641 ESTs 0.18 

129265 X58277 Hs.171695 dual specfficfty phosphatase 1 0.18 

102347 U37518 HS.B3429 tumor necrosis factor (Ugand) superfami 0.18 

114542 AA055768 Hs.122576 ESTs 0.18 

123900 AA621223 Hs. 11 2953 EST 0.19 

121780 AA422088 Hs.124660 ESTs 0.19 

115662 AA405715 H&64179 hypoftetical protein 0.19 

113803 W42789 H&31446 ESTs 0.19 

105493 AA256268 Hs.10283 ESTs 0.19 

113195 T57112 yc20g1U1 Strafcgene lung (#837210) Horn 0.19 

129462 D84239 Hs.111732 IgGFcNrafrigprofeh 0.19 

133664 XB6693 Hs.75445 hevfn 02 

126180 R18070 Hs^712 ubiquiwi-cytDdHoma c reductase; Rieste 02 

100687 H6311&KT3291 GcfcMbp (G&L18862) 02 



130064 TC7053 

101367 M12963 

132254 L20826 

105646 AA282147 

132883 AA047151 

132618 AA253330 

108931 AA147186 

131421 X64177 

107295 T34527 

103576 Z26317 

105173 AA182030 

134843 H60595 

102009 U02680 

123997 D51171 

108609 AA458652 

101300 L40391 

129717 AA481670 

108555 AA085342 

121314 AM02799 

124803 R45480 

130208 AA620556 

132888 AA490775 

132720 Z69881 

1Q2239 U26726 

115764 AA421562 

13(^58 K98654 

.122666 AA455052 

134495 D63477 

124017 F022Q2 

106925 AA491261 

115187 AA261805 

105309 AA233790 

124457 N50114 

130816 AA233763 

105795 AA369245 

134579 N23222 



Hs.181125 iinnuinogtobulnianibdagenedus^ 02 

Hs.73843 aloohd dehydrogenase 1 (cbss I); alpha 02 

Hs.430 ptestin 1 0 isoform) 02 

K&5888 ESTs 02 
H&5897 Homo sapiens mRNA; cONA DKFZp5B6P1622 (f 

H&5344 aaaptof-reJated protein complex 1 ; gamma 0.21 

H&250746 ESTs 022 

Hs2667 roetallothioneinlH 022 

Hs.80120 UDP44^y^tfpha^alactosaniinen^ 022 

Hs2631 desmogteln2 022 

HsJ364 ESTs 022 

Hs.9G061 progesterone bmalng proteta 022 

Hs.82643 protein tyrosine kinase 9 023 

Hs.78902 vofiage4epenfent anion channel 2 023 

H&32181 ESTs 023 

Hs.6445 Homo sapiens (done 3153) mRNA fragment 023 

Hs.12150 ESTs; WeaWy similar to retina! shorten 023 

Hs.1526 ATPase; Ca++ transporting; cardiac musd 023 

Hs.182538 ESTs 023 

Hs.164866 cycflnK 023 

Ha 15250 peroxisomal D3;D2-enoyl-CcA Isomerase 023 

Hs,5920 UDP-N-eceryJgJueosanTine~2^^ 023 

Hs5541 ATPase; (>h- transporting; ubiquitous 023 

Ha.1376 hydraxysterokJ (1 1-beta) dehydrogenase 2 023 

H&91011 airtermr gradient 2 pCenepus bevts) hom 024 

Hs.15984 ESTs; WeaKy similar to gene pp21 protei 024 

Hs.99387 ESTs 024 

Hs.84087 WAA0143 protein 024 

Hs.100960 ESTs 024 
Hs.37558 Homo sapiens done 23923 mRNA sequence 024 

Hs.44021 ESTs 024 

Hs.4104 ESTs; WeaJdy sirnBar to cDNA EST yfc386g7 024 

Hs.128704 ESTs 024 
Hs.16726 Homo sapiens mRNA; cONA DKFZp564A132 (fr 

Hs.17448 ESTs; WeaHy similar to Ifli ALU SUBFAM1 025 

Hs.85963 CD36 antigen (collagen type I receptor; 025 
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Table 13: H chip - Met vs Normal query -op in Mets 



Pfccy. Uiui^ii£ fffffr ptobcscft vfCTitificr ^ ^l ^nfr ^ T 

ExACQX RjiniH'^ 1 *^ A^lif ^TT ^Tfl^T' i Cff lll ti***^ WCf SHOP ! imwit^ f 



Unigcoc "fittfr Unigcoc prrift tzOc 



Ptey ExAccn UniGJD ConiptetejnUe Rafio Rtetfflormai 

102133 U2075B Hs313 secreted p ho sptap otefa 1 (osteoponfa; &457 

111307 N73988 H&37477 ESTs; Weaidy simBar to CGM41 protein O05 

103119 X63629 H&2877 cadherin3; P-carihoin (placenlal) 5.207 

131564 AA491465 Ha28792 ESTs 5.136 

119729 W69747 H&94806 WAA1062 proton 4.667 

124059 F13673 H&99769 ESTs 4398 
123987 C21171 HsS5497 ESTs: WeaHysfato to GLUCOSE TRANSPOR 4292 

128817 N47524 Hs£8491 spennkiinefepefmhe M-acetyBransferase 1364 

• 133770 M69197 Hs^42279 haptagUftHtito proteh 3323 

130412 AA408554 Hsl241572 go|gIautDaii^^suUanBlya;5 3.719 

104755 AAQ24482 Hs.9029 DKFZP434G032 protein 3702 

132676 AA283035 Hs£4813 ESTs 3£45 

134453 XTO683 Hs&m SXr(^tetBm*to9n%}oi\Y}bax4 3581 

124690 R05818 Hs.173830 ESTs 3.446 

106949 AA496805 Hs.177425 WAA0964 protein 3.42 

130724 AA370091 Hs.179680 ESTs 3.402 

128992 R49693 Hs.107708 ESTs 332 

133421 AA436560 Hs.7327 cfaudinl $255 

103047 X55990 Hs73839 rfl»nudease;RNaseA&n%3(eos&iop 1229 

102990 X51441 Hs.181062 serum amyloid A1 1149 

115429 AA284139 H&89295 EST 3.114 

129158 J05257 Ha 109 dfpepfidase 1 (renal) 3.019 

123533 AA608751 Hs^44904 ESTs; Weaidy stmQar to Oil ALU SUBFAMI 3.011 
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Table 14: H chip - Met vs Normal query - down in Mets 



Ptccyt VTofipiff Bos probcsc^ identifier nmribor 

ExAcco* Exfln oyfar Accession mnBbci» Gcnbmilc secession number 

UnigcocID: Uoigene number 

UmgeneHdc Umgenc gcoc title 



103466 Y00339 Hs.155097 
104258 AF007216 Hs.5462 
108999 AA156064 Hs.72115 
101046 K01160 
133565 H57056 Hs^04831 
101346 L76465 Hs.77348 
102085 U11313 Hs.75760 
111855 R37362 H&21351 
130320 U19495 Hs237356 
123137 AA487468 Hs.100686 
107222 D51235 Hs.82689 
102638 067319 Hs.9216 
128906 AA487557 Hs.10706 
129105 AA224351 Hs.108681 
110837 N307S6 Hs.17424 
100116 D00654 Hs.77443 
116786 H25836 H&83429 
130867 J04093 Hs2056 
132836 F09557 Ks57929 
131861 011925 Hs.184245 
106674 AA461303 Hs.7946 
109272 AA195718 Hs,S6030 
132711 N73702 Ha238927 
106569 AA455983 Hs. 11 781 6 
104636 AA0O4415 Hs.106106 
118823 N79237 H&50813 
134534 X73501 Hs.84905 
119479 W32094 H&55501 
113778 W15263 H&5422 
128482 U83908 Ha.100407 
124653 N92884 Hs.109641 
133407 AA093348 Hs.7306 
135237 AA454930 H&9691 
116250 AA480975 Hs.44829 
132617 AA171913 H&5338 
131273 AA421139 Hs.173542 ESTs 
116710 F10577 Hs.70312 
131791 S71043 Hs322» 
112483 R66534 Hs.23403 
132017 W67251 H&37331 Homo sapiens vav 3 oncogene (VAV3) mRNAO.116 
124308 K93575 K&227146 Hoix) sapiens mRNA;cDNADKFZp564J142 (fir 0.117 
114846 AA234929 Hs.44343 ESTs 0.119 
116551 D20458 K&229071 EST 0.12 
105299 AA233511 Hs.194720 ATP-ttmfing cassette; suMamfly G (WHIT 0.122 
130366 L11708 Hs.155109 hydroxysteroH (17-beJa) dehydrogenase 2 0.122 
'133806 M12759 Hs.76325 Human lg J chain gene 0.122 
104776 AA026349 HsJ31412 ESTs 0.125 
129565 X77777 Hs. 198726 vasoacfive Intestinal peptide receptor 1 0.125 
131272 AA423884 Hs. 139033 patemaSy expressed gene 3 0.127 
105774 AA348014 Hs^3412 ESTs ai28 
134604 M22995 K&865 RAP1A; member of RAS oncogene family 0.128 
134711 XD4011 Hs*8974 cytochrome b-245; beta poly^^ 0.128 
129113 AA147646 Hs.108740 DKFZP586A0522 protein 0.133 
123995 051119 Hs.100090 tetraspan3 0.133 
129168 T90621 Hs.109052 chromosoma 14 open reading frame 2 0.133 
123891 AA621103 Ha99216 ESTs; Modeiatety dmfer to Oil ALU SUB 0.135 
132694 M60830 H&5509 ecotropfc vtral integration sfte 2B 0.135 
135342 W60097 Hs,99120 DEADfrl (Asp-Glu-Ata-Asp/His) box polypep 0.135 
131510 AA207114 H&27842 ESTs; WeaHy similar to similar to 1-acy ai37 
133652 AA287383 Hs.7540 ESTs ai37 
134749 L10955 Hs.89485 raroonfcanhydraselV 0.139 
106566 AA456598 Hs^56269 ESTs 0139 



Coroptets 1 ifle 


Kflfift MflffNf)flfl3l 

rouiv HIVM iwiwi 


uuuuinu wiuijmiwo ii 


0.012 


solute carrier family 4; sodium bJceiibon 




CO 19 




Accession not Bsted In GenbanK 


0041 

U.VT 1 


E9I9 


nod? 




0.043 


cfpfnl CfllTtW DfnfPiTl ? 

WWIW WJIM |#IWIGUI Wm. 


0.054 


COIS 


0*055 


stomal oeB-denved factor 1 


0.058 


ku 1 0| fwcsoiwj omiuku vj acunnj lantern • 


0.06 


tumor rejecuon wygfln uu»Oj i 


008 
U.UO 


C8Spssfi7f apoptosis-related cystein© pr 


0.063 


ESTs 


n riRR 

U.U03 


bo is 




ESTs; Weakly stmBar to semaphorin F [H. 


0.069 


ac&i; gamma 2; smooth muscle; enteric 


O071 


tumor necrosis factor (Pjgand) superCamj 




UDP glycosyftransfeiase 1 




aui (Uiuaupi utoy iHHiwiuy o 


n 07R 


K1AA0929 protein Msx2 kitacfog nucfea 


O081 


DKFZP586D1519 protein 


0.084 


ESTs 


0.088 


ESTs 


0.091 


SGfCfcl 


0.092 


ESTs 


0.093 


ESTs; WeaWy simflar to long chain fatly 


0.094 


H. Sapiens mRNA for cytoteratin 20 


0.095 


ESTs 


0.096 


ESTs 


0.098 


programmed col death 4 


0.102 


ESTs 


0.106 


secreted frtzzfed-reJated protein 1 


0.108 


ESTs 


0,109 


ESTs 


0.111 


carbonic anhydiase XII 
ESTs 


0.112 

aii3 


ESTs 


0114 


immunoglobulin afpha 1 


0114 


ESTs 


0.115 
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106893 AA469836 Bs.2S2S3 ESTs 0.139 

101070 10Z7K Hs.1650 dcwrweguteted In adenoma 014 

114293 Z40718 Ha20196 adenylate cyclase 9 Q.14 

113966 W8660O H&9842 ESTs ai41 

101165 11&72 Hs.170037 arji hydrocarbon receptor a 145 

131492 AA393876 Hs.1255 rajctearpece|toajbfem^2;groupF;m 0145 

133889 AAQ99391 Ha211582 rnyosk; Bght polypeptte kinase 0.145 

120914 AA377254 HsJ97107 EST 0147 

118771 N74690 H&50547 ESTs 0.149 

105496 AA256323 Ks25264 DKFZP434N126 profeh 0.151 

131011 R41771 Hs22146 ESTs ai53 

1CS210 AM28239 Hs.10338 ESTs 0.154 

114069 Z38161 Hs.197335 plasma gtonate carboxypepfiaase 0.154 

133011 AA042990 Hs.171921 sema domain; brarainogtotxi^donEhOg); M54 

115967 AA4468S7 Hs.42911 ESTs 0.154 

102571 U60115 Homo sapiens skeJeyiriedeUM-piotdn 0.155 

100687 HG311&HT3291 Goffi-Msp (Gb±18862) 0.1f3 

132903 AA235404 K&5S85 Homo sapiens done 25186 resequence 0.155 

125832 AA628600 Hs.1 17587 ESTs 0.155 

130084 T67G53. Hs.181125 inranunogboufin lambda gene duster 0.157 

123264 AA491003 H&99824 BCE-1proteki ai59 

130919 AA291710 H&21276 cotegen; type IV; alpha 3 (Goodpasture 0.159 

103542 Z11793 Ks3314 selenoprotein P; plasma; 1 0.161 

101478 M23379 Hs758 RASp21 pfDteffiacfivator(GTPas8adJva 0.152 

108921 AA142913 Hs.71721 ESTs 0.164 

100642 HG2743^fT392S CaBesmon 1, AftSpfe6,N0n4rfuscte 0.167 

132109 AA599801 Hs.40098 ESTs 0.167 

115719 AA416997 H&5S622 ESTs 0.169 

128915 C02386 Hs.107139 ESTs 0 .171 

117634 N36421 Hs.107854 ESTs; WeaHysoraferto SODIUM- AND CHLO ai72 

129462 D84239 Hs.111732 IgG Fc birring proteh 0.174 

131328 V01512 H&25647 ^FEJmurme osteos ar coma viral onco 0.176 

130343 AA490262 Hs.15485 ESTs;Wealdysim3artoAPlCAUJXEPRGT 0.177 

115764 AA421562 Hs31011 anterior gradient 2 (Xenepusfeevis) horn 0.177 

122261 AA43&30 H&989Q2 ESTs 0.179 

108605 AA457718 Hs21103 Homo sapiens mRNA; cDNA DKFZp564B076 (fr 0.179 

109991 H09813 Hs.12896 KIAA1034 protein 0.181 

101300 L40391 Hi6445 Homo sapfens (done s153) mRNA fragment 0.181 

123080 AA485303 H&205126 polymeric Immunoglobulin receptor 0.182 

130016 AA055B11 Hs.143131 transmembrane glyccpfotein 0.186 

122666 AA455052 H&99387 ESTs 0.188 

105453 AA252893 Hs.9(H)1 ESTs 0189 

108980 AA151676 H&33455 pepu^arpjrunedelmhase: typeD o!l9 

100248 D31888 Hs.78398 WAA0071 prate* 0.192 

1300% M195260 K&206738 ESTs; Moderate^ siniar to ID) ALU SUB a 192 

110882 N38001 Hs.17348 ESTs; WeaWy simtor to lilt ALU SUBFAfvti 0.193 

131676 C20785 Ks30514 ESTs 0.195 

111029 N54792 H&24697 cytitftne nwnophosphate^celyJne^ 0.196 

131257 AA256042 Hs£4908 ESTs 0.196 

133348 723537 Hs.7149 ESTs 0.196 

133784 AA214305 HsJ6173 ESTs 0.196 

113863 W68388 Hs£1288 ESTs; WeaMy sfrnflar to K1AAQ704 protein 0.197 

103158 X67235 Hs.1 18651 tematopdefcaDy expressed homeobax ai98 

102347 U37518 Hs33429 tumor necrosis factor (Qg^ 02 

111351 N90223 Hs^3332 ESTs 02 

123495 AA599850 Hs.106747 ESTs; Weatiy simiar to similar to BPTU 0.2 

123802 AA620448 Hs.61408 Homo salens done 24760 mRNA sequence 0.2 

129243 H88033 Hs.109727 KIAA0733 protein 0.2 

130219 R77539 Hs.15285 ESTs 02 

131171 H04644 Hs.167619 ESTs; Weakly similar to I HI ALU SUBFAMI 02 

133746 U44378 Hs.75862 MAD (moffiers agair^ctecapeniapJegic; Dr 02 

116459 AA621399 Ks.64193 ESTs O201 

109613 F03031 K&27519 ESTs O202 

133435 T23983 Ks.7365 ESTs 0.202 

103002 X52001 Hs.1408 endomeSn3 O204 

125153 W38294 Accession not feted in Genbank 0.204 

131919 AA121266 H&34641 ESTs 02D4 

100749 HG3521-HT3715 Ras-Related Protein Raplb 0.205 

105085 AA147537 Hs.4811 ESTs OJ2Q8 

124571 N67470 Hs.173074 DKFZP56401863 protem 021 

129519 AA298786 Hs.1 12242 ESTs 0.21 

116724 F13665 Hs*5641 ESTs 0.21 

132932 T15482 Ha.6093 ESTs 021 

113803 W42789 H&31446 ESTs 0211 

110792 N24899 H&6630 ESTs 0.212 

105178 AA187490 HsJ21941 ESTs 0212 
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107295 734527 H&80120 UDP4kusty1*^ 0212 



115282 AA279112 H&88594 ESTs 0213 

115839 AA429Q38 Hs.40541 ESTs 0213 

103211 X73079 HsJ205126 pdymeric [mmunogtobulln receptor 0214 

108604 AA099820 Hs.49696 ESTs 0215 

105173 AA182030 H&8364 ESTs 0217 

108539 AA084677 Hs54558 ESTs; WeaMy simfer to protein B [Rsap 0217 

109984 K09594 Hs.1 0299 ESTs 0217 

133536 Y00264 Hs.1 77486 amyloid beta (A4) precursor protein (pro 0217 

129965 T71333 Hs.1 3854 ESTs 0219 

114542 AA055768 Hs.122578 ESTs 0219 

132982 L02326 Hs.198118 {mrrunogfobulin lambda^ce polypeptide 2 022 
101809 M86849 Homo sapiens connexin 26 (GJB2)mRNA,co0222 

105795 AA369245 Hs.17448 ESTs; Weakly similar to III! ALU SUBFAM1 022 

132119 H99211 Hs.40334 ESTs 0222 

132733 R25385 Hs.123654 WAAD824 proteto 0222 

109415 AA227219 Hs.1 10826 trinucteo«e repeat containmg9 0222 

113083 T40530 Ha8241 ESTs; WeaHy similar to heat shock prote 0223 

107053 AA600147 H&5741 ESTs; Weakly sinter to NADH^qrlochrome 0224 

103653 Z7Q295 H&32966 guanylate cyclase activator 2B (uroguany 0225 



104613 AA001049 Ks24713 Homo sapiens mRNA; cDNA DKFZp586G0123 (f 0225 



126180 R18070 H&3712 ubfepirnokytoduoniecreAic^ 0227 

132015 011900 K&3731 ESTs 0227 

130616 AA233763 Hs.1 6726 Homo sapiens mHNA; cDNA DKFZp564A132 (fr 0227 

132863 AA047151 Ha5897 Honiosap^rnRNA;cONADKFZp586P1622(f 023 

123169 AA488892 Hs.1 04472 ESTs; Weakly slrnaar to Gag-Pd pdyprot 0233 

115187 AA261805 Hs.44021 ESTs 0234 

116787 H28581 Hs.1 5541 ESTs 0234 
113195 T57112 yc20g1U1 Stratagenelung (#937210) Horn 0235 

130707 W45457 Hs203559 ESTs 0235 

124803 R45480 Hs.164866 cydinK 0235 

116844 H64938 Hs38331 ESTs 0235 

102759 U81607 Hs.788 A kinase (PRKA) anchor protein (gravin) 0238 

130584 AA009839 Hs.180841 tumor necrosis factor receptor superfami 0238 

133240 031161 Hs.68613 ESTs 0238 

132952 AA425154 Hs.61426 ESTs 0239 

132720 2G9881 H&5541 ATPase; (>m. transporting; ubiquitous 024 

131734 D62985 Hs31297 ESTs 024 

111890 R38678 Hs.12365 ESTs 0241 

102325 U35139 Hs.50130 necdin (mouse) homotog 0244 

104968 AA084602 Hs2SK9 ESTs 0244 

105674 AA234755 Hs214742 CDW52 anfigen (CAMPATH-1 anfigen) 0244 

120519 AA258585 Hs.1 29897 cadherin 1 9 (NOTE: redefinition of symbo 0244 

134675 AA250745 H3.87773 protein kinase; cAMP-dependent; catalyll 0244 

130642 M63438 Hs.156110 rnirranwgtobufin kappa variable ir>8 .0245 

134418 R78190 Hs.62933 ESTs; Weakly sfrniar to cDNA EST EMBLT0 0245 

115137 AA257976 K&56156. ESTs 0245 

131713 X57809 Hs.1 B1 125 imrminoglobulln lambda gena cfuster 0246 

108931 AA147186 Hs250746 ESTs 0246 

106609 AA458652 Hs32181 ESTs 0248 

115559 AA393810 Hs.41067 ESTs 025 

133985 L34657 Hs.78146 pbtoJet/er^e^r^adheskmmolec 025 

134088 D43636 Hs.79025 WAA0396 protein 025 

134487 R38185 H&83954 Homo sapiens unknown mRNA 025 
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Table 15: I chip - Met vs Normal query - up in Mets 



Pfccy; Unique Eos probcsct identifier ininibcr 
UmgBie Title Ifirfgcws gnie tiflc 



Ptey 


ExJ\ccn 


UniGJD 


TBte 


Outfit HiilMnim tl 


319379 


T91443 


Hs.193963 


ESTs 




321920 


N63915 




EST duster f not bi LbriGenfil 


11 Q 


314522 


AI732331 


Hs. 187750 


ESTs; Modeiate^sbritotoOnAUJ CIA 


723 


315720 


AW291875 


Hs.163500 


ESTs 


646 


308010 


AM39190 


Hs.181165 


euJcarycfictranstafion efangafion factor 


5.76 


313774 


AW136836 


Ks.144583 


ESTs 


5.01 


30)734 


AW205197 


Hs24Q951 


ESTs 


338 


337895 






CH22_BvtAC0{K50aGB«CAN56-2 


198 


312339 
331644 


AA524394 




EST duster (not in Un&ate) 


346 


T99544 


Hs.173734 


ESTs; Weakly similar to IIS ALU CLASS B 


153 


324643 


AI436356 


Hs.130729 


ESTs 


352 


324302 


AA543008 


Ks.136806 


ESTs; WeaHy strife to DO ALU SUBFAMl 


3.41 


314912 


AI431345 


Ks.161784 


ESTs 


333 


319403 


T98413 




EST duster (not fn UnEene) 


332 


308676 


AI761036 


Hs.163848 


ESTstngJe4on{mrtInUnK5en8)wahexon 


327 


331B58 


AA421163 


ESTs 


322 


315178 


AW362945 


Hs.162459 


ESTs 


321 


321354 


AA078493 




EST duster (not tn Unftene) 


3.18 


337898 






CX2ZtfMXX)550Q.GBiSCAlt5&S 


ais 


322682 


A1110S79 




ESTcfaBter(notInUniGene) 


3.15 


313197 


AI738851 


Hs222487 


ESTs 


3.1 


308991 


AI879831 




EST singteten (not In UraGene) with exon 


348 


310016 


AW449612 


Hs.152475 


ESTs 


345 



145 



WO 02/068677 



PCT/US02/06001 



Table 16: I chip - Met vs Normal query - down in Mets 



Pkcy: Unique Eos probcsct ident ifier number 

UmgendD: Unigcnc number 

Umgene Title Um'gene gme title 



Pkcy ExJUxn UniGJD title Ratio Met/Normal 

303041 AF127035 EST cluster (not hUnJGeneJwiftexonh 0.02 
302360 AJ010901 Hs.1S8267 mucin 4; tracheobronchial 0.03 
301948 AA344647 Hs.116724 aMoJeeto reductase fantilyl; member B11 0X3 

336091 CH2SLFGB4ES.K9.3 0.04 

333657 CH2ZJ=GENES^41J2 O04 

333658 CH22J=GENES.241_4 O04 
333737 CH22J=GENES^61J O05 
333656 CH22J=GBJES.240_4 0X5 
302347 AFO39400 Ha 134659 cWoride chaxtnet caWum acflvatetfcfem 0.06 
336084 CH22JH3ENES.688J3 0X6 
330385 AA449749 H&31386 ESTs; Highly similar to secreted apoptos 0X6 
304487 AA434241 EST singleton (not In UniGene) with exon 0.07 
302292 AF067797 EST duster (not in UniGene) wfih axon h 0.07 
334030 CH22J=GENEa320 - 2 0.07 
332859 CH22J=GENES^7J2 O07 
333654 CH2LFGEfESw240JJ O07 
303270 AL120518 Hs.105352 ESTs 0.08 
320352 Y13323 Hs.145298 dsWegrin protease 0X8 
333637 CH22.FGENES.229J2 0X8 
324094 AA382603 EST duster (not in UniGene) 0.08 
32)580 U6705B Hs.168102 Human proteinase activated receptor-2 mR O08 
330622 X63597 H&2996 sucrase^somaflase 0.08 
331441 H75860 r&39720 ESTs 0.08 
306601 AI719930 EST singleton (not h UniGene) with exon 0.09 
323770 AA722425 EST duster (not h UniGene) 0X9 
335188 CH22_FGBES.507_3 0X9 
333730 CH22JGENES.258J O09 
304480 AA430373 EST smgietm (not in UniGene) with exon 0.09 
336081 CH2?_FGENES.688J0 01 
332071 AA598594 Hs.1 12475 ESTs - 01 
318538 N2B625 Ks.74034 cavecfin 1; caveofee protein; 22kD 01 
311331 AJ679622 H&32225 Ininiuroojobunn alpha 1 01 
319668 NM_002731 EST duster (net in UniGene) 011 
332567 N23730 H&25647 v-fosFBJ murine osteosarcoma viral onco 011 
319395 AW062570 Hs.1 3809 ESTs 0.11 
315594 AI983437 Hs.155145 ESTs 011 
321539 N98619 Hs.62461 ARP2 (adkwetated protein Z yeast) ho 0.12 
333647 CH22JH3ENESJ235-2 M2 
333588 CH22J=GENES206^2 012 
321286 AI380940 EST cluster (not h UniGene) 012 
320727 U96044 EST duster (not In UniGene) 013 
335687 CH2LFGENES£98_2 013 
324611 AA743462 Hs.1 55337 ESTs 0.14 

335115 CH22J : GBtESA9f>_2 0.14 
324660 AA541644 Hs.186044 ESTs 014 
337951 CH2a_9AACC05500.GB4SCAN.94-1 014 

302332 AI833168 Hs.184507 Homo sapiens Chromosomal 6 BAC clone CIT 0.14 

300921 AW293224 Hi232165 ESTs 0.14 

333646 CH2ZJ=GENES.234_2 0.14 

335116 CH2?J=GENES.496J . 014 
320211 AL039402 Hs.125783 DEME-6 protein 0.15 

336092 CH22_FGBES£893 015 
330673 D57823 H&92962 Sec23 (S. cerovisiae) homotog A 0.16 

303042 AF129532 EST duster (not In UniGene) with exon h 0.16 
337954 CH22_0VtACOO55OO.GBISCAN^M 016 
336645 CH22_fGENES^6-1 0.16 
335651 CH22_FGENES590^ 0.16 
314499 ALD44570 Hs.147975 ESTs 017 
336124 CH22JK3B<ES.701J9 0.17 
315199 AA877996 Ha.125376 ESTs 0.17 
324525 AW044647 Hs.196284 ESTs 0.17 
320825 NAUJ04751 EST duster (not in UniGene) 0.18 
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AA377072 Hs.1 29792 


Hon© sapiens OuomDsomB 16 BAG ctona CTT 








CH2£J3£NESJj88_12 


0.18 


JaayQfl 






yva MA ptf^MMMk ruin. «% 

CH2£JGBiES239L2 


ais 




W44372 




EST duster (not tn UnSene) 


0.19 




AWC37973 Hs.125731 


ESTs 


0.19 




* ft IMA * A 

AI493046 


Ks.146133 


ESTs 


ai9 




U39840 


Hs.105440 


hepatocyte roaiear factor 3; alpha 


ai9 








CH22_F3ENE&231_2 


0.19 




AA843388 


Hs.190586 


ESTs 


02 








CH22_F&3tiES224_2 


021 




C75094 


Hs.198839 


ESTs; Highly shfiarto KG22 jftsaptens) 


021 




T73428 


K&191264 


ESTs 


021 




AA431366 




EST duster (not tn UniGena) 


021 


333835 






CH22_FGB« r E&228_2 


022 




AK08999 


Hs207570 


ESTs 


022 




N3&5585 


Hs.146246 


ESTs 


022 


9*MT4D 


AI557019 


Hs.1 15457 


ESTs 


022 


335652 






CH22_FGBE&590_3 


022 


307783 


AI347274 




EST stagfetan (notin UnXSene) wffli axon 


022 


331344 


AA357927 


HS.702CS 


ESTs 


022 








CH22_FGENESJ388_17 


023 




D83824 


H&185055 


BENE protein 


023 


335592 






CH2£/GaESJ96JT 


023 


333593 






CHZLFGENES^IOJ 


023 


335667 






CH2^J=GBESJ90J8 


024 


314853 


AA729232 


Hs.153279 


ESTs 


024 


320244 


AA296922 


Hs.129778 


— JLj.t-jJiiLtjLjili— L t .t ,-.««» rl,-i 

gasaotrasstmi peptxie 
ESTs 


024 


300601 


AJ762130 


H&165619 


024 


305080 


AA641485 




EST ^xjtefen (not in UniGene) wtlh exen 


025 


335189 






CH22JGBIE&507L4 


025 
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Table 17: B survivor vs Mets - Up in B survivor 



Pkcy: Unique Eos psobeset identifier masher 

ExAcau Exemplar Accession number, Genbsnk accession number 

lAugeselD: Unigene number 

UmgeneTnk: Unigene gene tifle 



Pkey ExAccn UnlGJD Compete Trtla Ratio BS/Met 

101006 J04132 Hs57087 CD3Z antigen; zeta polypeptide (TTT3 com 728 

114173 Z39Q50 Hs21963 ESTs 6.13 

130284 XB220B Hs.153961 ARP1 (acfeHBteted protein 1; y^si) ho 577 

100787 HG3872-HT4142 Imnurtwgtobufln Gamma Heavy Chain, V(6)DJc Regions (Gb:U13200) 5.63 

132461 AA4Q5775 Hs.49005 hypotttefcai protein 552 

1338C8 M12759 Hs.76325 Human tg J chain gens 5.46 

133747 D86972 Hs.75863 WAA0218 gene product 5.45 

123328 AA496968 Hs. 105403 EST 5J28 

132571 X763Q2 H&54649 putafivenuctoacM binding protetnRY 525 

132018 AA293194 K&3737 ESTs 522 

100186 D17516 Hs.4748 adenylate cyclase acfivatfng polypeptide 5.14 

107155 AAS21202 Hs.7946 DKFZP586D1 519 protein 5.1 

103566 Z22555 Hs.180616 C03S antigen (collagen type I receptor; 5.06 

113355 779203 Hs.14480 ESTs 459 

129040 U38864 Hs.108139 zinc finger protein 212 456 

130214 H7B003 Hs.15266 ESTs 433 

129550 AA480991 Hs.113G25 ESTs 452 

129704 WB1301 Hs.12064 ubiquffin specific protease 22 451 

116425 AA609574 H&51483 ESTs 4.77 

105166 AA179787 H&30570 pdygJutamine bwfing proteh 1 4.65 

118765 N74442 Hs.183696 ESTs 45 

108999 AA156064 Hs.72115 ESTs 457 

112756 R93908 H&35258 ESTs 454 

111655 R16884 Hs.187462 ESTs 4.48 

119392 T90672 H&238859 ESTs 4.42 

131957 AA6090C8 Hs.183232 ESTs 4.41 

129275 D82061 Hs. 109993 Ke6 gene; mouse; human homoJog of 4.4 

113634 T95C85 Hs.125182 ESTs 4.4 

127187 AA297138 Hs207422 ESTs 4.32 

101147 L13266 Hs.105 gkrtamate receptor; tonotropic; N-methyt 45 

134901 S78873 Hs50875 RAB Weracfing factor 426 

100896 HG4593W4998 Sodium Channel 1 424 

100687 HG311SHT3291 Goffi-Mbp (Gb:L1fl862) 421 

129758 AA599552 Hs.183770 Homo sapiens mRNA; cDNA DKFZp566P2346 (f 4.19 

105440 AA252243 Hs22851 ESTs 4.16 

131551 AA127867 Hs28608 ESTs 4.15 

113761 T99373 Hs. 159786 ESTs 459 

105897 AA401091 ESTs 457 

129495 AA382529 Hs239676 ESTs 456 

103436 X98208 Haptens mRNA for UV^ repressed sequen 453 

104673 AA007633 Hs20010 ESTs 453 

128886 136720 Hs.106880 bys&vfflce 452 

100702 HG3236-HT3413 Neurofifarornatosls 2 Tumor Suppressor (Gbl27065) 359 

123547 AA608820 Hs. 124085 WAA0921 protein 358 

134877 AA455241 Hs50527 ESTs 357 

123650 AA609332 Hs.180696 ESTs 354 



106482 AA451672 Hs.108824 ESTs; WeaWy simBarto cDNA ESTyk415c1 354 
101909 S69265 Homo saptens mRNA for PLE21 pfoteh; com 333 

108390 AA075070 zm86b6\s1 Stratagene ovarian cancer (#93 

LYMPHOCYTE ANTIGEN LY-6A2/LY-6E.1 PREC 353 



135403 U06643 Hs59923 tecfin; galactostfe-bindmg; soluble; 7 3.89 

121038 AA398536 Hs57365 ESTs 358 

128496 TB3496 Ks.100610 ESTs 3l86 

108785 AA128946 ESTs 356 

119838 W79499 Hs58580 ESTs 355 

130109 L12060 Hs.1497 refincfc add receptor gamma 354 

134538 U79288 Hs55053 WAA0513 gene product 353 

110310 H38209 Hs52728 EST 351 

110433 H49425 Hs32992 ESTs 3.73 

111834 R36138 Hs. 152458 ESTs 3.76 

130903 N27086 Hs21Q68 ESTs 3.74 



105142 AA164851 Hs.15380 ESTs; Weaidy similar to HERV-E envelope 3.73 
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U84569 Hs.153452 ctamsonB21 c^rea&gfraro2 373 
13)645 AAQ20942 Hs.17200 STAM«BpDt^icantah^SH3andrTA 3J3 
123378 AA521043 Ns.185832 ESTs 3J3 
103985 AA313880 EST185737Cotoncan±xjraa(HCC)cBfl0n 373 

112397 RS0822 H&26805 EST 372 
100980 J030S9 Hs.72931 v^nyc avian irryetocytaratosis viraJoncog 372 
1026(8 U64863 Hs.158297 progianrad ce3 death 1 37 
108974 AA151402 Hs.46531 ESTs 37 
13)192 Y12661 Hs.171014 VGFnewBgra^fafcrtnAxibte 369 
131318 X51699 Hs2558 bon8ganinMaita(ygIutei^(gla)|^ 368 
113759 T9S364 Ks.16074 Homo sapiens mRNA; cDNA DXFZp564l153 (fr 
133712 L19267 Hs.198836 dystrophia rr^otato<xmtalnIng WD repea 3£5 
134229 R15108 Hs3Q37 ESTs 365 
134241 AA300265 Hs3Q540 WAA0195 gene product 365 
1246S9 R06413 Hs.112278 arresfin; beia 1 362 
107343 U03115 Hs.103945 HiiiHnV beta T-cel receptor (TCRBV) gen 162 
128511 AA425636 Hs.10082 potassium HeRnedtefenal conductance 352 
105466 AA253412 Hs21489 ESTs 361 

131377 R41389 Hs26159 ESTs 35 

119135 R49548 Hs. 169681 deam efector aona&KontaWng 3.6 

132982 LQ2326 Hs.198118 brmnwojobulln bmbda-to pdypep&Je2 359 

128514 H84261 Hs.100843 ESTs; WeaHyslnte to similar to OT-b 356 

102396 U41804 Hs54411 putatire T VST2 receptor binding protein 355 

134945 R50247 Hs*1600 ESTs 355 

134913 X60483 Hsj91Q31 H4 hfefena famiy; member D 354 

102053 U07664 Hs37Q35 homeobaxHB9 352 

121569 AA412686 Hs37955 ESTs 352 

132560 AAC05315 Hs204524 ESTs; WeaWysmi8artoKlAA0747pnjte!n 351 

118458 N66580 Hs.161496 EST; WeaWy sim2ar to HC1 ORF (Mjnuscu) 351 

111518 RQ8160 H&222529 ESTs; WeaWy sfmfer to KB ALU SUBFAMl 351 

116795 H3885B Hs251783 EST 35 

130377 AA378316 Hs.155182 WAA1035pmtem 35 

121774 AA421758 H&98351 ESTs 3.49 

123413 AA521448 Hs.103845 ESTs 3.49 

133798 AA444115 Hs.76277 ESTs; Weakty similar to saDvary profina 3.49 

135183 X93996 Hs.239663 rrryetoioVlymphold ormfcerMneage teukem 3.48 

132479 AA477715 Hs.4953 gcSgi autoar%en; golgln subfamily a; 3 3.47 

117191 H99394 Hs.40339 EST 3.47 

130942 XB7852 Hs21432 Haptens mRNA for SEX gene 3.46 

130700 D55696 Hs. 18069 protease; cysteine; 1 (tegumain) 3.43 

131301 T17386 Hs.164501 ESTs 3.43 

100818 HG4018W4288 OptoMim^CeflAdhesfan Motecub 3.43 

103393 X94612 H&41749 proteii kinase; cGMP^depandent type li 3.43 

131337 AA228116 Hs.170204 K1AA0551 proteki 3,42 

133403 X68688 Hs.72991 zinc finger protein 33b (KDX 31) 3.42 
124728 R16231 Hs. 106620 Homo sapiens done 23950 mRNA sequence 3.41 

123168 AA488881 Hs.105218 EST 339 

123324 AA4S6932 Hs.105399 WAAD8Q9 protein 338 

106947 AA496685 H&37936 suppressor ()fvaj1eg^^(Drosc^ 338 

116717 F11065 Hs.79363 ESTs 136 
102794 U88629 Hs.173334 ELL-RELATED RNA POLYMERASE (I; ELONGATK) 

117503 N31963 Hs.44286 ESTs 333 

11222) R5Q295 Hs25703 ESTs 333 

106340 AA441792 H&22857 chofddornalrKxmlain^ 333 

106308 AA436186 Hs30662 ESTs 332 

130894 D16105 K&210 leukocyte tyrosine kinase 331 

120039 W92548 H&94385 ESTs 331 

131428 U17838 H&26719 PRdoniaiicorrtair^2:^2NFdorna{n 33 

113285 766830 Hs.162712 ESTs 33 

109458 AA232648 Hs57068 ESTs 329 

132134 AA242904 Hs.40537 prdme-rfch Gla (G^arbocygJutarracacW 329 

118964 N93330 Hs54937 Homo sapiens done 24722 unknown mRNA; p 

127621 AI218205 Ha 116204 ESTs 329 

135149 U400Q2 Hs45351 Dpase; homvne^ensffive 328 

114371 Z41835 Ha27810 ESTs 328 

130043 AAQ55404 Hs.193953 ESTs;WeaWysirnfartolIl!ALUSUBFAMl 327 

121347 AA405181 H&97972 ESTs 325 

105754 AA302657 Hs. 192028 ESTs 325 

121327 AA404288 Hs.173125 rjepfidytproiyi isoni^ 325 

111204 N68295 Ks37982 ESTs 325 
120949 AA397830 Hs38347 ESTs; Weakly simfer to GUOMA PATHOGBIE 
130024 U15197 H&241560 Human histo-toxJ group ABO protein mRNA 324 

12J005 T61449 Hs. 193727 ESTs ~ 324 

121067 AA3S8662 H&973Q2 ESTs 324 

120996 AA398281 Hs.143684 ESTs 323 

117101 H94043 Hs24341 DKFZP586J1419 proteh 323 
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130708 U40490 Hs.16136 nioflnamkfe nudeoSde tianshydrogenase 123 

130270 L40399 Hs.153820 hypomeocaJraotein 122 

131605 AA256220 H&29383 ESTs 122 

100854 HG4194+1T4464 SodiurrtHydrogen Exchangers 122 

123026 AA481072 H&99743 ESTs 121 

108323 AA070204 zm88bls1 Strategene neura^theOim (#9 32 

104259 AF007833 Hs.159265 Homo sapiens kmppekelated zktc finger 32 

133711 J0413O Hs.75703 sniaJI inducfcte cyWto 12 

112261 R52145 Ha25894 ESTs; Hjght/sknBartohypoheScal pro 3.19 

119529 W38053 Accession not feted in Genbank 3.19 

122386 AA446221 Hs£092 F-box protein containing teurina-rfcri re 3.19 

109157 AA179161 Hs.73562 ESTs 3.19 

119903 W85707 Hs.75936 erymrocyte membrane proteh hand 45 (d 3.18 

127452 AA491317 aa65c01 jI NCLCGAPJ5CB1 Homo sapfens cD 3.18 

124229 H62793 Hs.221892 ESTs 3.18 

129221 AA417128 Hs.1 09571 translocaseof hrwniaochcmdrialnienibr 3.17 

133185 AA481404 Hs.6686 ESTs 3.16 

121479 AM11911 Hs.98110 ESTs 3.16 

133872 T79868 Hs.1 80803 hypofoeficaJ protefn 3.16 

132504 U12897 H&5Q22 Imprinted In Pnader-WSi syndrome 3.16 

103089 X60382 Hs.1 79729 collagen; type X;aipha1 (Schnid metaph 3.15 

129654 AA019943 Hs.1 18463 H^a^mRNA for unknown Bverorphan 3.15 

117295 N22360 Hs.43153 ESTs 3.15 

107349 U48224 Hs.1 58321 beaded Sament stnjdural protein 2;ph 3.14 

103451 X99459 Hs.1 54782 adaptor-related protein complex 3; sfema 3.14 

114854 AA235056 Hs.1 20244 ESTs 3.14 

121044 AA398551 Hs.97374 ESTs 3.13 

128582 U22963 Hs.101840 major hsstocompafblBy complex; dass 3.13 

112598 R78565 rte.138395 EST 3.13 

113170 T54342 Hs.222506 ESTs 3.13 

111714 R23146 Hs33466 ESTs 3.13 

111809 R33616 HSu24688 EST 3.12 

115249 AA278961 Hs.71124 ESTs 3.11 

103228 X75546 H&230 ebromoduDn 3.11 

129944 100389 Hs.1 381 cytochrome P450; subfemfly ! (aromalcc 3.11 

107927 AAD28915 Hs^37709 EST 3.11 

130297 H94949 Hs.171955 trophhirvassisfing protein (tasfin) 3.1 

125742 H81181 Hs.183654 ESTs; WeaHy slmQar to unknown [Sxerev 3.1 

134802 L35546 Hs.89709 glutamate-cystelne Dgase (gamrrta^iufamy 3.1 

112560 R72293 Hs.6179 Homo sapiens mRNA; cDNA DKFZp586K2322 (f 3.1 

129266 AA343881 Hs.203061 sudD (suppressor of bfanD8; Aspergillus n 3.09 

126982 AA211419 small lndMabfecytDktoeA5(RANTES) 3.09 

131594 H29723 Hs£9261 ESTs; WeaJdysfenSar to serine protease 3.08 

134910 AA431320 Hs.9100 ESTs 3.08 

103505 Y09912 Hs.33102 transcription factor AP-2 beta (acfivafi 108 

110525 H57330 Hs.37430 EST 3.07 

123276 AA491270 Hs.1 87946 ESTs 3.06 

130519 H91819 Hs.10669 ESTs; Moderately simiar to K1AA04C0 pi 3.06 

126621 AA192638 zq01h08.r1 Strafegene muscle 9372C9 Homo 3.05 

134327 AFQ06041 Hs.178743 deam-assooated protein 6 3.04 

103513 Y10209 H^aptens mRNA for CD3L protein 3.04 

131243 R16667 H&24752 spectrin SH3 domain binding protein 1 3.04 

115187 AA261605 Ks.44021 ESTs 3.04 

107543 Z43703 Hs.4552 Homo sapiens HRWFB2157 mRNA; partial cd!04 

134051 S67070 Hs.78846 heat shock 27© protein 2 3.04 

113461 T86737 Hs.193536 ESTs 3.03 

130490 X57522 Hs.158164 ATP-btrcfing cassette; sub-family B (MDR/ 3.03 

128843 AA234141 HsJ203QO4 katanin p80 (WD4toontaInhg) suburut B 3.03 

100941 HG862-WTB62 Transifion Protein 2 3.03 

122268 AA436855 Hs.178202 ESTs 3.02 

107425 W26719 Hs30204 ESTs 3.02 

130930 U19261 TNF receptor-associated factor 1 102 

132958 W90398 Hs.6147 WAA1075 protein 102 

100973 J02888 Hs.73956 NAD(P)H menadione oxictoreductase 2; dtox 101 

104924 AA058532 H&28774 ESTs 101 

129998 Y10055 Hs.1 62808 pncsprwtoc^ide^dnase; cataJyfc;de 3.01 

130023 X13461 Hs239600 calnxxMivSke 3 101 

129536 M33493 Hs.1 84504 tryptasa; alpha 3 

112015 R42B36 HsJ23198 ESTs 3 

103036 X54925 H&83169 matrix rrcrfafopoleinase 1 (irrtersW 2S9 

100756 HG356&HT3768 ZIncRr^Proteto(GfcM88357) 259 

103425 X97301 Haptens mRNA for Ptg-1 1 protein 239 

118291 N63076 Hs.138746 EST 238 

125877 H15229 ym30g04.r1 Scares mfent brain 1NiB Homo 

repefifive etement * u mRNA sequence. 2.98 

101371 M13232 Hs36989 coagulafion factor VH (serum prothrombi 298 
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102958 X15675 H&93174 HunHnentfpgenousi^TDVtepHEl (B*V9) 257 

121183 AA400138 Ks577Q3 ESTs 2J97 

119241 T12559 rte221382 ESTs 258 

115067 AA253458 Hs51299 postrrefcScseojega&on tnarased 2-to Z96 

12S196 AAC84394 znOSglOlsl StratagenehKT neuron (383723 258 

111642 R16153 Hs.128740 OTs;H|gMys!ntotD!»a>5pisajtos 255 

100833 HG 4638-H T5050 Spfceoscma] Proton Sap 49 255 

129370 AA287879 Hs.1 10796 ESTsiKtodaatBJy^ntotoGTP-Widra 254 

128915 C02386 Hs.107139 ESTs 254 

101888 NB6233 Hs52891 glutaftm&JninsfeiasebM 254 

1243S4 N29724 garnmafc-aibpfin 253 

103559 Z19585 Hs.75774 flmjatapcraffn 4 253 

107882 AAQS630 Hs.17801 ESTs; Moderate^ dmilar to sertne^r^ 253 

134919 T39639 Ks51142 rWype sp^ repertory protein (FUS 252 

110293 K3Q258 H&37165 collage* type DC; alpha 2 252 

132433 AAD82546 Hs.48516 ESTs 252 

127347 AA428350 ESTs 252 

121976 AA42S807 Hs38S32 ESTs 251 

13303 AA135492 H&6318 ESTs; KHgWyslnQartDperaxBomaJshor ' 251 

133413 S72M3 Hs.73133 metaftrfhfaneii 3 (growth Wi&itory fee 251 

111694 R22035 Ha23331 ESTs 251 

128389 F12681 H&205300 ESTs 25 

102464 U49260 Hs5828 mevabnata(dvho^)decaroaxy^se 25 

135358 C21431 Hs59486 ESTs; WeaJdy similar to aratarl (H^apfe 25 

108661 AA113287 Hs559Q5 ESTs; Weafy simSar to PTBASSOftATED S25 

102185 U20230 Human guanyi cyclase C gene, partial cds 259 

122071 AA431787 Hs58762 EST 259 

102040 U06088 Hs.1 59479 ga]actosambe(N-ac8tyl}^uSatesu& 259 

115689 AA410645 Hs.1 99014 ESTs 258 

135110 T15817 Hs.193788 nitric oxide synthase 2Apnducrote;hep 258 

118729 N73717 Hs.161526 EST 258 

129518 AA369807 Hs.1 12238 ESTs 258 

125788 R74309 Hs.44499 snaffa3RK-rahfaclor2 257 

128650 U57971 Hs.103124 ATPase; Ca+* transporting; plasma membra 257 

125938 H30751 Hs.182859 lifeguard 257 

100779 HG3731-KT4001 Imrounogtobu&i Heavy Chain, Voire Regions (Gb±23566) 257 

104451 M13299 Hs.1 021 19 blue cone pigment 256 

133539 M21574 Hs.74615 pSatel^d^edgrowm tactor receptor; 256 

119506 W37833 Hs55563 ESTs 256 

126568 AA190515 zp85d12j1 Stratagene Hela cefl s3 93721 256 

134184 X53742 Hs.79732 fibuQnl 256 

127633 AI339609 Hs.1 52733 pofasskim voltage-gated channel; Isk-rel 256 

128716 AA045978 Hs.1 73511 NADH dehydrogenase (ubiquinone) Fe-S pro 256 

107135 AA62D732 Hs^3247 ESTs 255 

117748 N47317 Hs.141858 ESTs 255 

1240% F04143 Hs.151032 Homo sapfensctone 23856 unknown mRNA;p 255 

135120 M449841 Hs.108300 NOfra(neg^rco^oftransCTqjfio 254 

102156 U17977 HSUl7977HumnftHoiia3tcDNA Hsaptera 2M 

129418 AA401401 Hs.1 1127 PET1 12 (yeast hcrnobgHIko 254 

103222 X74795 Hs.77171 mlnfchrornosome mahtenance deficient (S. 254 

12145 W38001 Accession not listed In Genbank 253 

100560 HG2228WT2305 Crystafiin, BetaB 253 

105370 AA236476 H&2Z791 ESTs; Weakly similar to transmembrane pr 253 

127038 AM68598 ESTs 253 

128788 AA029073 Hs.105685 ESTs 253 

119523 W38041 ' Accession not fisted In Genbank 252 

12S436 N31224 H&211579 niekronaoftesionniaieafe 252 

126559 R15866 HS.1702S3 tumor pnjteii 53^?^ proto; 1 252 

118183 N59287 Hs.48361 EST 252 

101298 140387 Hs.1 18633 Z^o^adenylatesynmetasefke 251 

131830 U33054 HsJ2959 G proiein^xxjp^ receptor kinase 2 (Dro 251 

124173 H41281 Hs.107619 ESTs 251 

102295 U32581 Homo sapfera KIAA0421 mRNA; partial cds 251 

129719 N66398 Hs.167766 ESTs;Moa^rateVslinIIartoPic^(XO[ 251 

126573 AA482Q23 Hs.155218 E1&«kDa^ssocfeted protein 5 251 

125477 AE70093 H&234642 aquaporin3 251 

108492 AA451898 Hs.7922 ESTs; WeaHy similar to cordairssln^ 

p1$ an RNA poiyrnerase (1 eJongafion fa 25 

132881 T86118 Hs58875 ESTs 25 

114733 AA133778 Hs55734 ESTs 2.79 

104818 AAC01611 Hs.1 86434 ESTs 279 

134137 F10045 Hs.79347 WAA0211 gene product 279 

133212 U8S79 Hs57846 teutocyte to/tte receptor; aibramflyB 278 
100882 HG44604fT4729 tinrramcgiDbuan Heavy Chain, Vd]c Regions (Gbi23554) £78 

104756 AA024622 Hs.15813 solute canto family 22 (organic caflon 278 

129861 N69507 Hs.129849 DKFZP564M182 proteh 278 
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120824 AA347548 Hs£6876 ESTs 278 

100684 HG3107-HT3283 Plasma Membrane Cakiim Pump Hpmca2a 2.78 

121789 AA423970 Hs.178111 ESTs 2.78 

101647 M59341 Hs.118200 coIonysflmuIafingfadDr2receptDr.be 2.78 

113722 T97957 Hs202S48 ESTs; W^ldy similar to altmativdysp 2.77 

115107 AA256371 Hs.186645 ESTs 277 

111464 R05518 Hs.19521 ESTs 2.77 

108446 AA079120 znfl&etsl Slratagene colon HT29 (&937221 277 

123921 AA621329 Hi250671 Hu DNAseq firm done 1163J1 on chr22q13 

prot (sfmflar to mouse Ceisrf; rat MEGF 277 

134445 M59488 Hs*3364 S1C0 calcium-binding protein; beta (neur 276 

114132 Z38688 Hs24192 ESTs 276 

120500 AA256430 Hs.132525 ESTs 276 

101860 M95610 Hs37165 cotegen; type IX; aJpha 2 276 

134430 H52105 Hs*309 K1AA0747 protein 276 

124152 H27216 Hs.107635 ESTs 276 

132268 AA058833 Hs23445 ESTs; Weakly smlrto similar to M. muscu 276 

116257 AA461493 Hs.88537 ESTs 276 

102438 U46570 Hs.7733 tetratapeptkfe repeat dornaki 1 275 

122393 AA446334 H&99064 ESTs 275 

107653 AA010210 Hs.47041 ESTs 275 

123674 AA609473 Hs.105187 ESTs; Moderately similar to kinesin Ike 275 

129858 TB6906 Hs.12970 ESTs 275 

130117 U06641 Hs.150207 UDP grycosyftransferase 2 femJy; porype 275 

133464 M13982 Hs.73917 MeneuWn4 275 

127039 AA233366 H&256491 ESTs 274 

128318 AA418202 Hs.13810 ESTs 274 

123363 AA504818 Hs.171279 ESTs 274 

127654 AA649249 Hs.75640 natriurefc peptide precursor A 274 

132067 L2D860 Hs. 178382 glycoprotein lb (platelet); beta pdypep 274 

125664 AA948418 H&25744 ESTs; Weakry dmter to Ydr41 2wp [S.cere 273 

132354 L05187 H&21 1913 srnafl prcfoe-nch protein 1A 273 

101568 M33764 Hs.75212 c^irtedecarooxyiasel 273 

101438 M20777 Hs.159263 Homo sapiens; alpha2 (VI)co8agen 273 

116233 AA479082 Hs.61142 ESTs 273 

122194 AA435882 Hs.97531 ESTs . 272 

113995 W88466 Hs^2010 ESTs 272 

124251 H68286 Hs.107924 ESTs 271 
120583 AA281304 Hs.78614 comptemertt component 1; q subcomponent b 2.71 

134958 072507 H&234216 Hunan 40871 mRNA partial sequence 271 

124280 H85835 Hs.100058 tfhy*opyrimloTnase-Dlce4 271 

130113 M84673 Hs.1499 healshocktranscnpfen factor 1 271 
106588 AA456612 fe25682 ESTs; WeaHy smtr to PHOSPHATTDYLETHANOL 271 

132023 F01927 Hs.3743 ESTs; WeaWy similar to proflne-nch pro 27 

112284 R53558 Hs.26052 ESTs 27 

107897 AA026240 Hs.61387 ESTs 2.7 

122610 AA453598 Hs.99336 ESTs 27 

119070 R27788 Ks.52302 ESTs 27 

103491 Y08836 Honio sapiens mRNA for HfWite protein 27 

108225 AA058B43 Hs.161620 EST 27 

105829 AA398290 Hs.21965 ESTs 269 

127749 AI251757 Hs.145234 ESTs 269 

128428 AI185718 Hs. 143900 ESTs 2.69 

108409 AA075578 zm88h3.s1 Stratagene ovarian cancer (#93 269 

114739 AA134923 Hs.103633 ESTs; WeaWysimiar to predicted using 268 

128821 D87002 Hs.135 mulfipte UniGene matches 268 

107412 W26105 Hs.8961 ESTs 268 

117012 H85893 Hs.194387 ESTs; WeaWy simflax to ffil ALU SUBFAMI 268 

135262 AA416551 H&9732 ESTs 268 

105367 AA236397 HsJ20304 ESTs 268 

134771 L13939 Hs.89576 adaptor-rofated protein complex 1; beta 2.68 

105036 AA128617 Hsl25549 ESTs 268 

125093 792930 Hs. 186750 ESTs 268 

119340 761899 Hs.90677 ESTs; Highly similar to CGI-82 protein [ 267 

132603 H62900 Hs^3066 hsp7(Wntefacfing protein 267 

113733 798386 Hs.184548 ESTs 267 

123564 AA608902 Hs.112612 ESTs 266 

116059 AA454165 Hs.53455 ESTs 2£6 

125803 R79373 Hs^9852 ESTs 266 

123012 AA479962 Hs.139636 EST 266 

106080 AA418046 Hs.35124 ESTs 266 

128809 759668 Hs.102267 lysyl oxidase 266 

104354 H08988 Hs.113759 ESTs 266 

107068 AA609028 Hs^032 ESTs 265 

101418 M17754 rte.1276 BN51 (BHK21) temperature sensffivity com 265 

135157 AA460138 Hs.95582 SRY (sex-determining region >>box 20 265 
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123312 AA43S258 Hs5S6Q1 ESTs 255 

130034 CC0350 Hs.14454 c hr omoso me 2 open reading Samel 255 

103897 AA248870 Hs55Q58 ESTs 255 

117771 N47961 Hs.46794 ESTs 255 

109980 H09529 Hs3693 DKFZP586J0917 protein 254 

1219% AM29S53 Hs58616 EST 254 

114233 Z39BS2 Hs^7457 ESTs 254 

129594 R70379 Hs. 115396 Human ^nr^lgD (to gene; (>region; 263 

102319 U34587 Ks.8678 oorScotopbrelea^honixiimrecGzAv- 253 

111700 R22212 KbJ23361 ESTs 253 

127385 AA001S28 Hs.74335 teat shock 90H3 proteh 1; beta 253 

104205 AA496240 Hs.17270 DKFZF434C211 protein 253 

124559 N86223 Hs. 135328 ESTs; WeaMy similar to Ml ALU SUBFAM) 253 

108351 AA442772 Hs.191987 ESTs; Weakly sirn2ar to llfl AW SIBFAMI 253 

121903 AA427605 HsJ258742 irryosto^mrAtg prote&i C; cardiac 252 

116442 AAS2Q310 Hs.184343 ESTs; Weatysirrto to K1AA0535 pro^ 252 

127041 F06090 HSC(MG031ixjnnaD2ed&fent brain cDNAH 262 

132880 103049 Hs58435 FYN-bkufing protein (FYB-12V130) 252 

131591 L22454 Ks.180069 nuclear raspatory factor 1 2.61 

118118 N56901 Hs.47995 ESTs 251 

134809 X52611 Hs.18387 transa^floneactorAP-2 alpha (acfivat 251 

117706 N45091 Hs.46472 ESTs 251 

127468 AA312179 Hs.1788f7 ESTs; WeaWysim2ar to C6J-82 protein [ 251 

114891 AA235984 Ks57489 ESTs 2.6 

116426 AA609668 HsJ1657 ESTs Z6 

132589 AA432197 Hs5260 ESTs; Weakly simflar to CGJ-03 protein [ 25 

128410 AM52788 zx39g11 s\ SoaresJWaUetusJfljaiFB 9» 25 

108081 AA418394 H&25354 ESTs 26 

129919 R02003 Hs.191208 ESTs; Weakly similar to weak simSarSy 259 

124672 R0O307 Hs.188504 ESTs 259 

122758 M459013 Hs59742 X*ay repair conplemem^ defecfive rep 259 

1256% AA040118 Hs.78637 rteutraJ sprtkigomye^^ 259 

13C052 J00220 Hs.145288 Human tg acfive eps2on1 5 UT; V-DJ re 259 

134878 U28055 Hs^50826 rrmophagestaiiaimg; pseudogene9 259 

131908 L05624 Ha54tf mgogerwcfivated proton kinase kinase 259 

126470 AA843339 Hs.193163 ESTs; Weakly amllar to Cd-52 protein [ 259 

132353 M31651 Hs.46319 sex homione-btodlng globulin 258 

119588 W44559 Hs.142525 ESTs ^ 258 

131757 D17532 Hs516 DEAD/H (Asp-Gtu-Ala-Asp/His) box polypep 258 

118114 N56875 Hs. 143212 cystafln F (leukocystatin) 258 

128200 AE79952 Hs.158037 ESTs; Weakly slmQar to transcription re 258 

131208 C14586 H&24220 Homo sapiens mRNA; cDNA DKFZp566M051 (fr 256 

124721 R11131 Hs.154966 ESTs 257 

108706 AA121820 Homo sapiens mRNA for K1AA0842 protein; 257 

118831 N79592 Ha50838 ESTs 257 

115708 AA412212 Hs.44033 ESTs 257 

107233 059322 HsfflSS ESTs 257 

129559 AA234945 Hs.11360 ESTs 257 

126953 AA743849 Hs.127286 ESTs 256 

108165 AA055221 Hs.63168 ESTs 256 

104069 AA401547 Hs.172694 ESTs 256 

112146 R46512 H&25374 ESTs 256 

108384 AA074891 Hs.124917 ESTs; Highly similar to K1AA0838 protein 256 

131779 R49047 Hs.179779 ribosomaJ protein L37 256 

111829 R36070 Hs^5079 EST 255 

103424 X97267 Hs.155975 protein tyrosfoe phosphatase; receptort 255 

100133 D13118 Hs50S86 ATP synthase; H+ transporting; mtochond 255 

130208 AA620556 Hs.15250 rjeraxisomal D3J)2-enoytCoA isomerase 255 

124649 N92593 Hs.102907 ESTs 255 

106511 AA452865 Hsi06713 UDP-GatbetaGlcNAc bsta 1 ;4- gafcctosytt 255 

128467 AA176446 Hs.160428 ESTs; Weakfy sfcnfer to hypothe&al 43. 265 

113524 T90072 Hs.15060 ESTs 255 

107821 AA020991 Hs.172855 ESTs 255 

111900 R39044 - HsJ25318 Homo sapiens done JB194 mRNA sequence 254 

1099G8 H05255 Hs^03237 EST 254 

132069 D87454 Hs.192966 WAA0265 protein 254 

130660 T95262 Hs.17538 ESTs 254 

112983 T23443 HsJW ESTs 254 

128279 K08885 yt88b08Jl Scares infant braki 1NB Homo 254 

106415 AA447994 Hs^9188 ESTs 253 

116741 H03268 Hs.181746 EST 253 

103148 XB63S2 Hs^994 PCTAIRE protein kinase 3 253 

132336 AA342422 Hs.45073 ESTs 253 

129484 R92488 K5.111S89 ESTs 253 

110169 H19696 Hs51612 ESTs; Moceraiery sfmDarto CAGH4 [Usap 253 

116880 H68380 Hs.144174 EST 253 
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248 
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246 
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248 
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247 
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247 


126154 
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ESTs; Moderately simifarto HAAQ563 pro 
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EST 


247 


122274 


AA43704 


Hs.184456 


ESTs; Weakly similar to HI! ALU SUBFAM1 


246 


129600 


M78980 
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ESTs; Moderately sMarto unknown fis 


246 
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ESTs 
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131185 M25753 Hs23360 cycfcBI 2.48 

121451 AA4110G8 HaS80S5 EST 2.46 

104328 D81932 HUM424C5B Hu fetal Oram (TF^aara) H s 2.48 

12G543 AA723810 KSJ69517 ESTs; Highly stato to dffieraifiaflye 2.45 

123600 AAS09106 Hs.1 12644 ESTs 245 

131020 AA41175B Ks2B94 ESTs; WeaMy dmBar to irisato pjnelano 2.45 

134191 W289Q2 HsJ979 WAA0736geiie product 2.45 

12)446 X79510 Hs.155693 prot^^roslne^iosphatasacnQiHnqjt 2.45 

131613 R88228 H&29595 JfcHproiem 2.45 

11B8S4 N89570 Hs.42148 ESTs; WeaWy sim9ar to SifP) [Djnelanog 245 

104232 ABQQ2351 Hs.10537 WAA0353 profeto 245 

122604 AA453489 Hs39333 ESTs 245 

12)826 AA285064 Hs.104485 EST 245 

116655 PD3S66 H&68Q90 ESTs 244 

116267 AA485080 Hs 256539 ESTs 244 

114944 AA243172 H&87619 TH) protein 244 

127629 AA293279 HsJ9173 ESTs 244 

120350 AA211300 Hs.104166 ESTs 244 

103620 Z470S7 Hs.1ffi643 trariscnpfion etongafion factor B (SIB) 244 

131420 Z11737 Hs£ffi4 Bavin containing monoaxygenase 4 244 

131312 AA399228 H&25527 Bghi junction protein 3 {zona ocdudens 243 

122812 AA461044 Hs.142980 EST 243 
135100 AA398926 Hs251108 Honosaptontf^chronitsomel specffic243 

113464 TB6931 Hs.16295 ESTs 243 

100045 M11507 AFFX controt transferrin receptor 243 

128975 AA092129 Hs.107538 ESTs; Mocterately simferto /predlc&r 243 

103688 AA011479 Hs.154701 ESTs 243 
127331 F20188 HSPD05873HM3 Homo sapfenscDNA dona 05 

107337 T97111 Hs.191235 ESTs;WeaWyslmtertoYA324cptSxere 243 

122171 AA435750 H&98830 EST 243 

107601 AAQ04636 Hs^0223 ESTs 243 

119800 W73523 H&58314 ESTs 243 

104886 AA053348 Hs.144626 growfti cffiferenfefion factor 11 242 

122899 AA469960 Hs. 178420 ESTs; Highly sWar to WASP mteracflhg 242 

125933 AJ308037 H&8412D ESTs; WeaMy similar to nucteoporin p62 242 

121664 AA417291 H&97978 ESTs 242 
125450 AA377194 r&238909 ESTs; Weakly sfmflarto POLYPOSIS LOCUS 242 

114611 AA081374 Hs.108110 DKFZP547E21 10 protein 242 

111595 R11492 Hs.191225 ESTs 242 

111671 R19368 Hik229084 EST 242 

110687 H93005 Hs.177311 ESTs 242 

103019 X53414 Hs.144567 alanine-glyoxyiate aminotransferase (oxa 242 

119076 R36634 HsJ2355M ESTs 242 

130589 AA234308 Hs.16441 DKFZP434H204 proteh 242 

125975 AA495891 Hs.152290 ESTs; Highly similar to PACAP type-3/VIP 242 

106380 AA446188 Hs.16614 ESTs 241 

121965 AA429652 Hs.1 04901 EST 241 

121604 AA416788 H&98259 EST 241 

100885 HG4490HT4876 Pnjflne-Rfch Profetn Pib4. Altete 241 

117807 N48701 Hs.46523 EST 241 

119840 W79525 Hs£8586 ESTs 241 

102458 U48861 Ha54397 choBnagte receptor; nicotinic; bete po 241 

116152 AA460920 Hs£15683 ESTs; Moderator similar to Ufl ALU SUB 241 
126741 AA522512 Hs373 Homo sapiens mRNA; cDNA DKFZp566L2123 (f 

103381 X92715 Hs-3057 zinc finger protein 74 (Cos52) Z41 

124837 R55630 Hs^33602 KIAA0596 protem 241 

129322 AA437153 Hs.1 10407 ESTs; WeaWysNbr to coded for by C. 24 

129291 AA281330 Hs.110099 core-binding factor; nirf domain; alpha 24 

124789 R43603 Hs.78110 ESTs; WeaWy simfer to F17A9.2 [Cefega 24 ' 

133253 Y00970 Hs.183088 acrosin 2.4 

116990 N94447 H&55047 EST 24 

134897 R71427 H&9081 ptenytalanyURNA synthetase beta^subuni 24 

116572 D45654 Hs.65582 DKFZP58SC1 324 protein 24 

104294 D14539 H&234774 niyetoloViyniphoIdcrm 24 

118764 N74440 Hs^05264 ESTs 24 
117437 N27645 yw5e3.s1 Weizmarm Ofectofy Epfthefium H 

3 , stoi3artocomah3L1.13L1rep^ • 24 

111651 R16733 Hs2W99 ESTs Z39 

109583 F02322 H&26135 ESTs 239 

12969 R94247 Hs.193879 ESTs 239 

130647 AA457216 HsJM4190 WBffeulon enhancer bmdlng factor 1 239 

113706 T97467 Hs.18065 ESTs 239 

133469 L03785 Hs.170482 myosin; Bght polypeptide 5; regufatoiy 239 

118256 N62837 Hs.48647 fcnmunoglclHiIIn-Sse Iransaipt7 239 

121656 AA417243 HsS©12 ESTs 239 

126530 AI422841 Hs.180086 ESTs 239 
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123708 AA6Q9S48 Hs-207767 EST 239 

107875 AA0253O8 Hsj61182 ESTs 239 

111711 R22891 Hs.7093 ESTs 239 

131405 U79255 H&26468 amytoid beta (A4) precursor protelivttnd 239 

127454 AA502957 Hs.153590 ESTs 239 

132341 AA448419 Hs.45209 ESTs 238 

133673 D87673 Hs.75486 teat shock transcription factor 4 238 

113213 T58607 ya94a02s1 Stratagene placenta (£937225) 238 

1CS230 AM29356 Hs.12047 ESTs 238 

116692 RB261 Hs.66103 ESTs 238 

126197 AA172284 Hs.103657 ESTs; WeaHy sfrnfer to CH-TOG PROTEIN [ 238 

115966 AA446866 Hs.71371 ESTs 238 

132636 U65785 Hs.5417 oxygen regulated protein (150kD) 238 

109965 H09077 Hs.30895 EST 238 

130203 L14754 Hs.1521 immunoglobulin mu bfndhg protein 2 2.38 

131332 R50487 Hs35717 ESTs 238 

119105 R42357 Hs.91453 ESTs 237 

129253 W69316 Hs.109778 ESTs; Weakly sinter to stater to beta- 237 

113602 T92558 Hs.17036 ESTs 237 

118102 N55272 Hs.145798 ESTs 237 

100734 HG3432-HT3620 Rbrobbst Growth Factor RecejrfDrK-SanvAlLSpfioe^K-Samra 237 

111533 R08548 Hs.251651 EST 237 

130813 U12259 Hs.198 paired box gene 3 {Waardertag syndrome 237 

119180 R80413 Hs.92520 ESTs 237 

109335 AA211443 Ks.86492 ESTs 237 

107386 U97698 Hs.159593 mucin 6; gastric 236 

122486 AA448328 Ks.115527 ESTs 236 

112997 T23548 Hs. 167467 ESTs 236 

109674 FD9051 Hs^1837 ESTs; WeaWy stater to WAA09Z7 protein 236 

128868 AA423827 Hs. 106730 hypoffisfcal protein 236 

127027 R17261 yg12g07.r1 Scares infant brain 1NIB H sa 236 

123099 AA485931 Hs.79 aminoacyfase 1 236 

115716 AA416767 Hs.43498 ESTs; WeaWy s&nflartoORF YKL201c[SjC 236 

130830 D86982 Hs*0Q60 WAAD229 proteh 2% 

109051 AA159920 Hs.72322 ESTs 236 

12)181 R39552 Hs.151608 Homosaplens done 23622 mRNA sequence 236 

131114 R46233 1&23107 ESTs 236 

123589 AA609047 Hs.168922 ESTs 236 

12)872 U03891 phortnOn (sirnferto apo^oprotaln B m 236 

131962 H78550 K&2780 jm D proto-oncogene 236 

130502 MS5QS7 Hs.1583 neufrophS cytosofc factor 1 (47kD; chr 236 

121785 AA423883 Hs. 142442 ESTs 2.35 

125405 T97171 Hs.121570 ESTs 235 

103682 AA000993 ESTs 235 

125649 T77395 Hs.194816 stomafirvAe prototo 1 235 

115452 AA285019 Hs.55263 ESTs; Highly sfcniar to mftoctondrta) dl 235 

129338 T56800 Hs.47274 Homo sapiens mRNA; cDNA DKFZp564B176 (fr 235 

103105 AA421268 Hs.149443 putative tumor suppressor 235 

134770 R72079 HsJ9575 CD79B arrbgen (knrnimcfltobuSn-associatEd 235 

119422 T99496 Hs^29598 EST 235 

109869 H02849 Hs30345 EST 235 

134314 AA263032 Hs31634 ATP synthase; H+ transporting; rrflochond 235 

114989 AA251Q97 Hs. 1891 19 ESTs 235 

122619 AA453755 Hs.191515 ESTs 235 

133129 AA428580 Ks.65551 ESTs 235 

1284© AA416762 Hs.10Q221 mjdearreceptixsubfen%1;groiraH;m 235 

115636 AA402715 Hs.58369 ESTs 235 

130836 J05068 HsJ2012 transcobafemln I (vitamin B12 binding pr 234 

132385 Y10256 Ks.47007 serine/jhreonsra ptoteWtfnase 234 

107776 AA018820 H^221147 ESTs 234 

109791 F10669 Hs.13228 DRE-antagonfet modulator; calsenffin 234 

124409 N33212 Hs.107197 ESTs 234 

131068 AA397916 Hs.22595 ESTs 234 

121079 AA398719 Hs.14169 ESTs; Weakly similar to CREBJrfncflng pro 234 

124662 N94340 Hs.171835 ESTs; WeaWy smfrto PUT PRE-MRNA SPUCI 234 

133820 M13686 Hs.177582 surfactant; pimonary^ssociated p^otr* 234 

129424 M55593 Hs.111301 rnatnxnnetanoprotBinas8 2(g8bfinase^ 234 

109066 AA161377 Hs.72404 EST 234 

100339 D63485 Hs.181359 WAA0151 gene product 234 

100809 HG3991-HT4261 Cpg r EnrtttedDna,CtoraE18 234 

120844 AA349417 Hs£6917 ESTs 233 

124927 R96146 Hs^21459 ESTs 233 

109779 F10527 Hs3353 Homo sapiens clone 24940 mRNA sequence 233 

101171 L16842 Hs.119251 ubiquinc+cytochronie c reductase core pr 233 

110805 N26904 Hs.24048 ESTs; WeaWy sfmter to FK506frapamycin- 233 

125440 AI090982 HS31895 ESTs 233 
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133159 AO0G0QS1 K&663 cysft: fibrosis traresnemb conductance re 233 

10183 AS1368 Hs.129753 sbhjtocsaSBrf^Bka&mdt^x^ 233 

125432 AA778565 Hs.142505 ESTs 233 

102774 U83303 Hs.164021 smafl IndudWe cytofosubfemfly B(CX 233 

1304® H50809 Ha 15760 ESTs; WeeWy similar to similar to Yeast 233 

126878 AI424759 H&238928 ESTs 233 

117338 mm Hs.43466 ESTs 232 

118862 N70877 Hs.13055 ESTs 232 

130354 AA416685 Ha.155001 UNC13 (C efegansVGte 232 

106760 AA477330 Hs.12293 ESTs 232 

124294 ^0573 Hs.102298 EST 232 

119428 W02129 H&55242 EST 232 

132829 Z40942 HsS383 ESTs 232 

1279® AA854181 Hs.143585 ESTs 232 
132728 AA293334 Hs5566 ESTs; Hlghty simflar to RAS-RELATED PR0T232 

121292 AA189116 H<l96168 ESTs 232 

107598 AA004528 Ha. 169444 ESTs 232 

128164 AW78174 Hs.144846 ESTs 232 

105753 AA299789 Hs.15277 ESTs 231 

131256 AA262340 Hs^4907 ccronin; ac&vtoicfing protesi;2B 231 

110891 N38863 H&234392 r^ateJetecflvalirgfectora^ 231 

116767 H13689 Hs32530 ESTs 231 

100545 HG2147-HT2217 Mucin 3, (ntesflraJ (GfcM55405) 231 

12254 W88995 Hs. 167641 ESTs; WeaWy similar to CI 5H9.5 [Oetega 231 

118387 N64579 yz51d1U1 Morton Fetal Cochlea Nsapien 231 

104335 D83847 Hs.183864 etostasa3B 231 

107464 W42944 Hs.171939 ESTs 231 

112304 R54798 Hs.26239 ESTs 231 

134313 AA136100 Hs.6673 triiucreofete repeat containing 15 231 

116322 AA490900 Hsl58643 ESTs; H&Wy similar to JWUOB IKsaptens 231 

111275 N70970 Hs35006 ESTs 231 

100109 AJ000480 Hs.143513 phosphoproteln regulated by mBogeruc pa 231 

109338 AA211717 Hs36507 ESTs 231 

134432 AA053022 Hs3312 ESTs 231 
129649 ADO00092 Hs.182628 Homo sapiens DNA from cftr 19p13.2 cosnid 

EKLF; GCDH; CRTC; and RAD23A genes; gen 

122623 AA453990 Hs.99248 ESTs 231 

112070 R43S76 Hs^36310 EST 231 

127683 AA668123 Hs. 134170 ESTs 231 

104920 AA057620 Hs30807 ESTs; Highly simiar to dJ18601.1 [H.sap 231 

1060S4 AA417373 Hs.15898 ESTs 231 

106782 AA478487 ESTs 231 

126709 AAQ28159 Hs.47234 ESTs 23 

105129 AA158386 Hs.186476 ESTs 23 

105719 AA291644 Hs36793 ESTs 23 

121698 AA4183S3 Hs. 10351 KIAA0308 protein 23 

119069 R27619 Hs^31046 EST 23 

130388 U72515 Hs.189583 pute^protBfflSfirtotor«ssy(Droso 23 

103444 X98801 Hs.74617 dynactin 1 (p150; Giued (DrosqMa)hom 23 
114604 AA076128 zm18g4.s1 Stratagene pancreas (§93728) H 

3*simBartoSW^1AJflJMANP39274SRI23 

103878 AA227635 Hs^02588 ESTs 23 

105828 AA3S8276 Hs.11962 ESTs 23 

119778 W72920 Hs38244 ESTs 23 

120401 AA234309 Hs.193011 ESTs 23 

116290 AA488691 H&57969 ptenytabnbe-KNA synfoetase 23 

130479 R44163 Hs.12457 Homo sapiens clone 23770 mRNA sequence 23 

104253 ARKB8 72 Hs.152944 loss of heterozygosity; 11; chromoBcmal 229 

132615 K66367 Hs£3358 ESTs; WeaWy similar to HfJ ALU SUBFArVB 229 

121954 AA429593 Hs£8587 ESTs 229 

101336 L49168 Ks.75678 FBJ murine osteosarcoma vira] oncogene h 229 

127247 AA313802 Hs£289 growth factor receptor-bound protein 2 229 

117300 N22565 Hs.43212 ESTs 229 

122229 AA43619& Hs. 103302 ESTs 229 

125105 T95768 Hs.189760 ESTs 229 

12BC03 R16100 Hs.166476 ESTs 229 

131279 AA08985 3 Hs25197 STIP1 tomotopjandUtoccmtainii^pRrt 229 

133838 M97796 Hs.180919 mhibfcrof DNAbMirQ2;c^minantneg 229 

111837 R36447 H&24453 ESTs ' 229 

111435 RO1620 Hs.19198 ESTs 229 

123613 AA60915B Hs.112656 EST 229 

133560 AA256365 Hs.7466 protein expressed in myroid 229 

122896 AA468952 Hs37899 ESTs; WeaJcry s&nfiar to da!2; fen343; C 229 

113378 T80627 Hs.14757 ESTs 229 

127174 AA2932D4 Hs. 139352 ESTs 229 

120153 Z39582 H&65777 EST 229 
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112741 R93080 H&35035 ESTs 128 

132152 AA044784 Hs.4105 Homo sapiens mRNA; cONA DKFZp585A0618 (f 228 

1097% F1Q685 Hs25031 ESTs 228 

113776 W04657 Hs24248 ESTs 228 

102934 X13451 Hu mRNA for lymphocyte DneageHstrcted 228 

126168 AA322034 EST24690 Cerebellum 11 Homo sapiens cDNA228 

126363 N947C8 Human Chromosome 16 BAC done C1T937SK-A 228 

101427 M19908 Hunian n^open»xiasegene,exons14 228 

132616 AA386264 Hs5337 Isoritiate dehydrogenase 2 (NADP+); mto 228 

105537 AA258813 Hs27160 ESTs 228 

1265Z7 AA548559 Hs.103853 ESTs 228 

115359 AA281936 HsJ8914 ESTs 228 

108474 AA079667 zm93d1.s1 Stratagene ovarian cncr (88372 228 

12)685 AA291066 Hs.105099 ESTs 228 

126171 AA704771 Hs.191942 ESTs 228 

112858 TQ2963 Hs.4454 ESTs 228 

121817 AA424826 Hs.98475 EST 228 

107895 AA025150 Hs.61384 ESTs 228 

131161 Z38223 H&23735 potassium voflage-o^ted channel subfaml 228 

135173 M72885 H&95910 Human G0S2protein gene; complete cds 227 

103182 X69819 H&99995 Werceflular adhesion mofecule 3 227 

113889 W72720 Hs.194347 ESTs 227 

128984 AA319615 Hs238030 secrelDiy carrier membrane protein 2 227 

101531 M29877 Hs576 fucosWase*alpha4.-1; tissue 227 

115916 AA436889 HSJ91910 ESTs 227 

129892 HS6850 Hs.89674 toBchyMIphosph^^ 227 

103035 X54871 Hs.776S0 RAB5B; member RAS oncogene family 227 

126479 T7B141 ESTs 227 

125778 R71976 Hs.161791 ESTs; WeaWy simitar to I1J1 ALU SUBFAMI 227 

108132 AA053586 H&63048 ESTs 227 

111017 N53965 Hs256327 ESTs 227 

127165 AA359719 Hs.127121 ESTs 227 

126446 A1421309 Hs.118926 DKFZP586KD919 protein 226 

107864 AA025061 Hs£1246 ESTs 226 

122277 AA437133 Hs38936 ESTs 226 

115604 AA400378 Hs.49391 ESTs 226 

105061 AA134824 Hs.4865 ESTs 226 

118549 N68163 Ks.49455 EST 226 

110509 H56493 Hs.61960 ESTs; Moderately similar to HYPOTHETICAL 125 

114088 Z38280 Hs26971 Human Chromosome 16 BAC done CTT987SK-2 226 

103225 X74837 Hs2750 mannosidase; alpha; class 1A; member 1 226 

125842 AA746654 Ha5181 pnjfiferaftwvassocfated 2G4; 38kD 226 

104538 R25069 HS.1758J1 ESTs 226 

130304 U09368 Hs.154205 zinc finger protein 140 (done pHZ39) 226 

120660 AA290743 HsS7242 ESTs 226 

124062 H00440 Hs.144524 ESTs; WeaWy similar to signal transduce 226 

103289 X80915 Hs.1573 growth altterenfiaflon factor 5 (cartfe 226 

109286 AA197273 Hs.191324 ESTs 226 

128555 U6Z739 Hs.101408 branched chain amlnotransfefase 2; mtoc 226 

129439 AA171694 Hs. 111461 cenitopiasmin (fenuxkiase) 226 

109221 AA192755 Ha85840 ESTs; WeaWy sim3ar to stac [Haptens] 226 

109906 H05084 Hs28077 ESTs; Highly shnflar to GDP-mannose pyro 226 

130540 U35234 Hs.159534 protein tyrosine ptwsphatase; receptort 226 

122870 AA465158 Hs.192861 SpW traiiscnpfion factor (SpMffUl r 226 

120219 Z41124 H&66045 EST 226 

128021 AJ001138 Hs.78223 N^cytaniiiwacyH^ 226 

121732 AA421047 H&98330 ESTs 226 

107817 AAQ20781 Ha60847 ESTs 225 

101069 L02648 Hs44232 transcobalamtn B; macrocytic anemte 225 

103065 X58399 H&81221 Human L2-9 trai^ptof unrearranged Im 225 

118019 N52585 Hs.47517 ESTs 225 

122220 AA436011 Hs.98187 ESTs 225 

109161 AA179392 Hs.73601 EST 225 

128699 KD3207 Hs.103972 proCne^ch protein BstNI subfamily 4 225 

101914 S71824 Hs.167988 neural eel adhesion mofecutel 225 

102697 U74667 Hs.6364 Tat Interactive protein (6QkD) 225 

119939 W88753 HsB2407 ESTs 225 

127793 AI298S35 Hs3Q445 ESTs; WeaWy similar to transcription re 225 

104450 L77564 Hs.103978 senneTmreortine kinase 2ffl (sperrnrogeiiB 225 

133096 AA136042 Hs. 131 053 ESTs 225 

115416 AA283893 Hs203868 ESTs 225 

117056 H90322 Hs.41387 EST 225 

115598 AA400129 Hs£5735 ESTs 225 

121267 AA401397 Hs.165296 ESTs; Highly similar to teiBaeavJfce 225 

104778 AA02S397 Ks.11039 Homo sapiens done 24604 mRNA sequence 225 

110926 N48252 Hs.135287 ESTs 224 
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102795 U88OT Hs.198336 ATP4*dIng cassefie; sub-femfly A CABC1 224 

118643 N70324 Hs.43340 ESTs 224 

1032)4 X82240 Hs2484 T-c^ tatofe/Jynqtona 1A 224 

134814 Z48475 HsJBS771 gluxtftesefhaxDUiBsa^iegulalDfy pr 224 

125912 AA171719 Hs£233 etAaiyofctransfefionfnMcn factor 224 

134385 R32377 K&82240 syntarfn3A 224 

117224 N20300 H&218707 ESTs 224 

107169 AA621601 Hs.184446 ESTs;W^dn<artos^GTPtodIn 224 

133948 R&S916 Hs77813 spttngomyeft) phosphexfc&lBfa&e 1; add 224 

101426 M19483 Hs25 ATP synthase; H+transpciftrc rrfochond 224 

119922 W8619B Hs.177384 ESTs 224 

123361 AA504810 Hs. 139649 EST 224 

123915 AA621298 Hs.112967 ESTs 224 

123540 AA6C3792 Hs.112591 EST 224 

124978 T40560 Hs221759 ESTs 224 

102354 U38268 Human qtochnsnebpsoidogene.paj&lc 224 

124198 H53099 Hs.198271 NADH dehydrogenase (ubiquinone) 1 a?pte 224 

102160 U18235 Hs.121561 ATP-teuOng cassette; sd>fernily A {ABC1 224 

107520 X76091 Hs.1 00007 regular factor X; 2 fmfluencesHUc 224 

131589 1)52100 Hs29191 epffiteHaJ rnembrara protein 2 224 

126633 AA206993 Hs.154145 guanine raid binding protein (G protein) 223 

130887 AA258379 Hs.155986 angloterdn receptor-He 2 223 

119894 W84670 H&58518 EST 223 

124544 N63837 Hs.40500 similar to S.cerevis!aeRER1 223 

103104 X61587 K&75082 rashomobg gene famfa member GfrhoG 223 

110119 H17306 Hs.177229 ESTs 223 

131411 AA464043 Hs26506 ESTs; Weakfy similar to NY-REN-45 anflga 223 

102346 U37359 Hs227297 rr^o8crecorrdjirafiQn(S.cerevBiae)11 223 

106003 AA411167 H&8734 ESTs; Moderately sMbr to QQ ALU CLA 223 

122564 AA452251 Hs^8669 ESTs 223 

133688 U42031 Hs.7557 FK506«ndfog protein 5 223 

132096 AA131410 H&3364 Homo sapiens clone 24877 mRNA sequence 223 

110038 H11746 rte31097 ESTs 223 

123788 AA62D293 Hs.1 12853 ESTs 223 

135070 X99350 H&93974 forkhead box J1 223 

104908 AA055841 Hs.154396 ESTs 222 

128674 AA025001 Hs.169452 ESTs 222 

100810 HG3S92HT4262 Cpg-Enrfcned Dna, Clone E35 222 

120065 W93579 Hs£9478 EST 222 

122775 AA459692 Hs.112143 ESTs 222 

125443 H71482 Hs.177592 rfcoscmal protein; targe; PI 222 

118617 N69666 Hs.183413 ESTs; Moderateiy similar to Iffl ALU SUB 222 

128001 AI167814 Hs.166664 ESTs 222 

128160 AI279080 Hs.143971 ESTs; Moderately similar to !BI ALU CLA 222 

1060)8 AA456644 H&27115 ESTs 222 

103485 Y08409 Hs248415 «iyroWhcrmcnerBspcnswSPOT14(rafl 222 

135008 AA173423 Ha.92913 ESTs; WeaHy simfer to R07G3.8 fCetega 222 

110122 H17333 Hs.1 59837 EST 222 

128397 AI393421 Hs.14032 ESTs 222 

110231 H24359 Hs2B733 ESTs 222 

123188 AA489092 Hs.177726 ESTs 222 

131903 AA481723 HsJ436 oetefedinaa* cancer (majse; hornotog) 222 

122649 AA454616 Hs50336 ATPase; H+ transporting; lysosomal (vacu 222 

133090 AA448228 Hs£468 ESTs 222 

108002 AAQ37664 H&55067 ESTs; WeaHy similar to TU7F121 gene pr 222 

133120 X64559 Hs.65424 tetranectin (pfasrrmogen^imfing protein 221 

114263 Z40073 Hs.6045 ESTs 221 

125518 R20148 Hs.1 93851 ESTs 221 

128613 U78551 Hs.1 02482 Homo sapfens gallbladder mucin MUC5B mRN 221 

102773 U83192 Hs23731 decs; ^eprasophDa) bornD!og4 221 

119526 W38049 Accession not fisted biGenbank 221 

126844 AA299325 EST11903 Uterusturncf I Hcrno sapiens cDN 221 

105860 AA399251 Hs.180933 ESTs; WeafystaBar to mefoyWfcG bind! 221 

126957 AA733145 Hs.194560 ESTs 221 

108959 AA150107 Hs*1810 ESTs 22 

131663 AA423926 HsJ0318 ESTs 22 

127468 HQ2941 HsJ888 ESTs 22 

104483 N42776 Hs.146233 ESTs 22 

123848 AA620773 Hs221996 ESTs 22 

101623 N65905 Hs.75342 n^erayine2;NAD(4<iepenc^n^ 22 

120872 AA357993 Hs3S99S ESTs 22 

135033 AA173241 Hs53454 ESTs 22 

122286 AA437259 Hs.104944 EST 22 

114862 AA235174 Hs£0250 ESTs 22 

10Q255 D38047 Hs.78466 proteasome (prosome; maoDpaln) 26S subu 22 

103063 X58234 Hs.123178 transiccaseofimerni3Dchrjr^ 22 
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132777 RS898 H&56663 ESTs 22 

133082 AA457129 Hs£455 RuvB(Eco0hon»togHkB2 22 

27529 AA558980 Hs.191750 ESTs 22 

114602 AAD75642 Hs.103594 deisled hmal^nantbram tumors 1 22 

20722 AA293435 H&97277 ESTs 22 

102675 U72S12 HunmB-ceO receptor associated protein 22 

128551 H09058 Hs.237323 M^cetyigIucosaniiiiQi)hos^ mutase; DK 22 

112120 R43001 Hs£2298 EST 22 

23625 AA609216 Hs.1 12666 EST 22 

120315 AA1S4266 Hs.178333 ESTs 22 

122081 AA431992 Hs.104920 ESTs 2.19 

101798 M85220 Accession not feted In Genbank 2.19 

111501 R07444 Hs.163118 ESTs 2.19 

I32832 D63482 Hs57734 KJAA0148 gene product 2.19 

00544 HG2147-HT2217 Mudn 3, Intestinal (Gb:M55405) 119 

I06835 AA482077 H&33713 ESTs; WeaWy similar to hypothetical pro 219 

132934 AA076145 Hs.61053 ESTs 2.19 

108762 AA127515 Hs.71787 ESTs; Highly similar to 30S nbosomal pr 2.19 

1 201 64 Z39733 Hs.158159 FAT tumor suppressor (Drosophfla) bomolo 2.19 

35395 L08096 HsJ9899 tumor necrosis factor (Ugand) superfemi 219 

101717 M69013 Hs.1686 guanine nucleotide binding protein (G pr 2.19 

121172 AA400013 Hs37750 EST 218 

114861 AA235123 Hs.40719 ESTs 218 

120851 AA349662 Hs.174248 ESTs 218 

121083 AA398736 Hs£7653 EST 118 

107171 AA621624 H&28088 Homo sapiens done 24515 mRNA sequence 218 

I28754 D31446 Hs.1 0488 Brealo^dustBriegtonprote&i; uteri 218 
100149 D13897 Hs.1 69249 pep&leYY '2.18 

132405 AA323787 Hs.4770 WAA1068 protein 2.18 
114666 AA1 12274 zm27g&s1 Stratagene pancreas (#93728) H 

etemen!^xntalnsetemertLTR8rBpefflv 2.18 

1 27008 AA223879 zr10g05.r1 Stratagene NT2 neuronal precu 218 

(10373 K42896 Ks29438 ESTs 2.18 

119354 T66942 Hs.100651 goJgl SNAP receptor complex member 2 218 

130115 M31627 Hs.149923 X-box binding protein 1 218 

130514 AA161Q85 Ha 15871 ESTs; WeaWy similar to add phosphatase 218 

I28848 HD8077 H&217179 ESTs; WeaWy sfrnilar to T27A1.5 [Celega 218 

110161 H19312 H&28096 ESTs 218 

I32367 X82224 H&46634 cysteMe conjugate-beta ryase; cytoplasm 218 

I25882 H45538 Hs.101448 metastasis associated 1 217 

113837 W57698 H&8888 ESTs 217 

I06376 AA444C04 Hs.6084 ESTs 217 

113755 T99075 Hs.18570 ESTs 217 

I07525 X91817 Hs.102866 taansfcetolase^to 1 217 

119207 R93188 H&84298 CD74 antigen (Invar polypept of majhist 217 

131862 AA236&5 3i)hosprrogrycefate Dehydrogenase 217 

115514 AA297739 Hs£5609 ESTs; WeaWy similar to ISOLEUCYL-TRNA S2.17 

112290 R53940 fe26016 ESTs 217 
126136 H83353 yv82f02r1 Scares melanocyte 7mM Homo 217 

121574 AM12712 Hs.1 19325 Hunfing&^^ntBiacfing protein A 217 

118530 N67900 Hs.1 18446 ESTs 216 

I32327 AA2Q3285 Hs.44892 ESTs; WeaWy amflar to dJ733D1 5.1 [H^ 216 

100564 HG2239+1T2324 Potassium Channel Protein (Gb21 1585) 216 

IS376 AAQ22622 Hs.13543 ESTs; Weakly similar to hypothetical pro 216 
135317 XB6012 Hs.98602 Human DNA sequence from fntron 22 ofthe 

9.5kb repeated region; int22h-1; invorv 216 

114973 AA250845 HsJ7762 ESTs 216 

107559 AA001504 Hs38860 ESTs 216 

111014 N53787 Hs.191117 ESTs 216 

101250 L34060 Hs.79133 cadherin8 216 

110697 H93721 Ha2Q798 ESTs 216 

126843 AA450166 Hs22641 ESTs; Moderately similar to predicted pr 216 

1 03272 AA063816 Hs.43773 ESTs 216 

125012 T66935 Hs.104859 ESTs 216 

111639 R16101 Hs.140834 EST 215 

123157 AA488443 Hs.100428 DKF2P584A063 protein 215 
1 02315 U34252 Hs3533 aldehyde dehydrogenase 9 (gamma^mirotJutZ15 

131897 AA287623 Hs3426 GTPase; human homotog of E. coQ esserrti 215 
121528 AA412253 HsJ238909 ESTs; WeaWy sunBarto POLYPOSIS LOCUS 215 

122808 AA460707 Hs.106397 ESTs 215 

125727 HC0958 Hs.181641 ESTs 215 

133279 AA069571 Hs.6957 Homo sapiens done 24616 mRNA sequence Z15 

103219 X74570 Hs.75268 siaryitransfefase4C (beta^alactosidase 215 

120881 AA362144 Ha 104601 EST 215 

134080 D42039 Hs.78871 WAA0081 protein 215 

106598 AA457140 Hs.1 1411 DKF2P566O084 protein 215 
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12576 R662D8 yi3Qh03j1 Soaies pbcenta Kh2HP H ^pia 

contains AIu repefifive dennn^oonlaki 2.15 

125727 AA037230 Hs.135084 (9^C(aroyUdai^opaSiyandcoeb 215 

10149) M2563 Hs.123107 teEDtreto 1; raHl/panarrasfeafiyaiy 115 

13708 AM17181 Hs.120858 ESTs 214 

100627 HGZ702-HT2793 SenWTItreonine Kinase (GfcZ25424) 214 

121703 AA418671 Hs.104807 ESTs 214 
106BG9 AA479704 H&22Q324 HuranDNAseqfmncfane283E3OTchr1p36 

Female Reproductive tract MIFR1;-2;f^ 214 
13525 F03573 Hs. 112305 Homo sapiens done 24955 mRNA sequence; 214 

100478 HG1QS7-HT1067 Mucm (GfcM224CS) 214 

118533 N69020 H&207689 EST 214 
114047 W944Z7 Ksl3807 ESTs; WeaHy similar to PBOSPHOLBWMAN PR 214 

128823 AA478207 Hs.10832 ESTs; Moderately simfar to sex-detami&i 214 

100534 HG19aOHT2Q23 Tubufin.Be^ 214 

105757 AA321146 Hs^Q596 ESTs 214 

109617 F03192 Hs^b789 ESTs; WeaWy similar to dJ162H14.1 [H-sa 214 

121547 AA412448 Hs.104777 ESTs 214 

119420 TS3291 Ks.102484 ^utefliion8&4raiBferaseA3 214 
120274 AA177051 nc02a02s1 NCLCGAP _Pr3 Homo sapiens cDN 

rep^reetenentianlmtBetonentLTR 214 

132933 AA5987Q2 Hs.6101 bone morphogenefe protein 6 214 

133405 X07881 Ks.73031 prolwHfcn protein BsftD subfamily 3 214 

119811 W73922 Hs.49047 ESTs 214 
134536 AA457735 H&850 IMP (inosins monophosphate} derryrfiugenas 214 

105125 AA157799 Hs.6980 aidotato reductase femDy 7; member A2 214 

101398 M15881 Hs.1137 uromoc^ (un^mira^ 214 

132751 AA397901 H&55993 ESTs 213 

115777 AA424142 1&39384 pufafive seaeted Ggand homologous to f 213 

123193 AA489228 Hs.136956 ESTs 213 

116875 H67749 Hs.161022 EST 213 

107271 D6QS07 Hs34931 EST 213 

134551 R44839 HsJ526 H)efe-1£WacetyJg!ucosam 213 

113413 TB3739 Hs.186512 ESTs 213 

120522 AA258843 Hs258748 ESTs 213 

119965 W87738 H&59039 EST 213 

131283 AA101601 Hs.183986 herpesvirus entry mediator B (poiovfrus 213 

107347 U43628 Hs.102598 mucosa vascular address^ eel adhesion 213 

116490 C142S5 Hs.66450 ESTs 213 



100563 HG2239-HT2324 Potassium Channel Prote*i(Gb21 1585) 213 

110441 H50302 Hs.19845 ESTs; Highly similar to protein phosphat 213 

101035 J05158 Hs.73858 caitxaypepfidase N; polypeptide 2; 83kD 213 

132500 AA047297 Hs,50107 ESTs; Moderately sfcnflar to COO [H^apte 213 

129807 L34820 Hs^299 akfefiyde dehydrogenase 5 ^Oy; memo^ 213 



106250 AA430466 Ha28890 ESTs 213 

113569 791086 Hs.162070 EST 213 

122911 AA470087 Hs239725 ESTs 213 

107452 W2B988 H&250746 ESTs 212 

111824 R35661 Hs25006 EST 212 

132831 U53442 H&57732 mtogeivacfivated protein kinase 11 212 

110244 H26742 Ha25367 ESTs; Weakly similar to ALR [Rsaptens] 212 

128918 H85347 Hs.107164 spectrin; beta; naverymmcyfc 1 212 

133728 M10901 Hs.75772 rnidear recep tor subfamily 3; group C; m 212 

122476 AA448211 Hs.39164 ESTs 212 

132004 L37360 Ha37054 ephrin-A3 212 

113971 W86760 Hs220682 ESTs 212 

103386 X92972 K&80324 pfotebprwsphato6;cafeIyfcsub^ 212 

131120 AA443676 H&23133 ESTs; WeaWy sim3ar to akchd sulfotra 212 

102186 U20285 G protein pathway suppressor 1 212 

103694 AA018541 Hs.60580 zinc foger protein 212 

111995 R42333 Hs2Q893 ESTs 212 

124436 N39596 Hs,182SJ4 ESTs 212 

100306 D50495 Hs.80598 transcnp&n etongafion factor A (SII); 212 

103084 X59932 Hs.77793 *src tyrosine Irfnase 211 

115092 AA255903 Ks*0975 CD39-Gke4 211 

121579 AA416543 Hs.1 11981 ESTs 211 

127101 AG49351 H&118944 ESTs 211 

121195 AA4C0273 Hs£7791 ESTs 211 

112721 R91484 Hs3QS53 ESTs 211 

113253 T64207 Hs£5296 HUB asscdated franscrfpt-1 211 

120838 AA348887 H&9S907 ESTs 211 

114122 Z38582 Hs.12751 ESTs 211 

112635 R822S8 H&29497 ESTs 211 

103785 AAO95600 HsJ225647 ESTs 211 
128260 AA331445 EST35277 Embryo. 8 week I Homo sapiens c 211 

122987 AA479155 Hs.103364 ESTs 211 
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110374 H42983 K&227263 ESTs 211 

116595 D6C625 Hs.177656 calmodulin 1 (phosphorybse kinase; dett 211 

126117 H78617 yu26aC8j1 Scares fetal Tver spleen 1NF 211 

116810 D80448 Hs.45177 ESTs 211 

111430 R01248 Ks.19165 ESTs 211 

106700 AA463929 H&28701 ESTs 211 

120181 Z40121 Hs.65870 ESTs; WeaMy statoto Prc-WI<fiJTPase 21 

132545 AA147218 H&5105 ESTs 21 

105005 AA115253 HsJ28805 ESTs 21 

126702 U54602 Hs^785 keratin 17 21 

124096 H10060 Hs.101687 EST 21 

13272) ZB9881 H&5541 ATPase; Ca++ tnansporcng; ubJqufews 21 

121926 AA428559 Hs.104895 ESTs 21 

125734 M157445 1&227391 DKFZP547E1 010 protein 21 

122368 AA443963 Hs.104964 EST 21 

116310 H72014 Hs.161031 ESTs; WeaHy shnfer to SYNAPT0TAGM1N I 21 

113171 T54613 Hs5761 EST 21 

134629 U00951 rte.87150 Human done A9A2BR11 (CACfr/{GTG)n repea 21 

105712 AA2912S3 H&25219 ESTs 21 

106S31 AA495918 H&26714 ESTs 21 

114278 Z40424 HiZ7728 ESTs 21 

116615 D80666 Hs.45203 ESTs 209 

10D189 D21089 Ha.320 xeroderma pigmerrtosum; conpfernentafion g 209 

119500 W37694 H&55561 ESTs 209 

129605 S72493 Hs.115947 keratin 16 (focal non^jHemioly&palm 209 

133912 X62744 Hs.77522 major histocompatilDfty comptex; dass 209 

129638 N34942 Ha 11 782 ESTs 209 

106372 AA443941 Hs.4992 tumor suppressing subtransferabte candid 209 

101885 M98539 H&8272 prostaglandin D2 synthase (21 kD; brain) 209 

132749 AA235989 Hi55967 short stature homeobox 2 209 

135042 X91348 Hs.93522 putative noivecdlng transcript (DiGeoige 209 

109404 AA224594 Hs.86941 ESTs 209 

101333 U7738 Hs.80313 p53 fndudbte protein 209 

100114 D00596 Hs.82962 thymMytatB synthetase 209 

130538 T17D45 Hs.159492 spastic ataxia of Crraitevobc-Sagueriay (s 209 

125772 R83S03 Hs.78040 KDH. (Lys-Asp-G*hUu) erctoptasmlc refic 209 

132192 AA247569 Hs.4209 ESTs 209 

124697 R06273 Hs.186467 ESTs; Moderately simBar to till ALU SUB 209 

127694 A1247780 Hs. 11 7036 ESTs 208 

127895 AA772600 Hs.187998 ESTs; Weakly similar to ATP-btodlng cass 208 

121315 AA402883 Hs.82269 progestage(vassodaiedenA)metiiaJprote 

endometrial alpha-2-gbbulin; afcha ute 208 

11219) R46576 Hs.23239 ESTs 208 

105054 AA133584 Hs^6333 JM1 protein 208 

113151 T51620 HsS326 EST 208 

118783 N75285 Hs.50593 ESTs; Moderately sJmBarto cytoplasmic . 208 

1M AA249580 Hs39975 ESTs; Moderately similar to COO [Ksapte 208 

135160 U77643 Hs.95655 secreted and transmembrane 1 208 

107516 X60152 zinc finger protein 2 208 

126055 N28990 yx39g04 xl Scares melanocyte 2*HM Homo 208 

116982 H81933 Hs.40317 ESTs 208 

101756 M77235 Hs.169331 sotfium channel; voltagegated; type V; a 208 

116935 H75763 Hs£3468 ESTs 208 

118556 N68408 Hs.194637 Homo sapiens mRNA; cDNA DKFZp564D1 13 (fr 2.08 

129812 L07807 Hs.166161 dynamlnl 208 

121946 AA429411 Hs.104888 ESTs 208 

133843 AA489045 Hs.76691 Homo sapiens dona 25100 mRNA sequence; 208 

122170 AA435744 Hs.163913 ESTs 208 

122399 AA446449 H&231112 EST 208 

105775 AA348274 Hs.6664 ESTs 208 

123943 AA621553 Hs.112998 ESTs 208 

105771 AA347967 H&256267 rteurobtastorna RAS viral (v-ras) oncogene 208 

114454 AA021091 Hs^26208 ESTs ~ 208 

125802 R78652 Hs.151099 ESTs 208 

131556 AA442853 H&2869 cydn-dependent kinase 5; regulatory su 208 

118837 N79836 H&216338 ESTs 208 

107345 U26209 Hs,1Q2307 sohits carrier femSy 13 (soaTuiTHlspende 208 

131324 H58690 Hs25625 ESTs 208 

105233 AA216759 Hs.191132 ESTs 207 

112886 TQ3864 Hs.7436 putafive acyttransterase 207 

120252 AA169400 Hs.152701 DKFZP434F124 protein 207 

114867 AA235310 H&52899 ESTs; Moderately stmflar to IN ALU SUB 207 

106715 AA464955 Hs.126062 ESTs; WeaWy stmilar to EPEBWAL GROWTH 207 

125560 R51281 Hs.13692 ESTs; Highly sknfcr to PROTEIN TSG24 [M 207 

112270 R53021 Hs203358 ESTs 207 

134626 S821S8 Hs^709 caJdecrin (serum calctom decreasing fact 207 
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115723 AA417345 Hs-54846 ESTs 207 
123895 AA621192 Hs.112949 EST 207 
119906 W85818 ESTs: Moderate^ stater toB Alii SUB 207 

108559 AA085161 zrr12cfr\s1 Staiagene hNT neuron (§937233 

MAGE54728 J sirr3artoTftG1151228 G 207 

101245 133799 HsJ2Qa»7 procDflagen&qtdDpepBdasegitencBr 207 

100663 HG2915WT3059 Major WstoccjqH&ffity 207 

114178 Z39063 Hs.17930 HurmDNAseq firm clone 1033B10 on chr6p 

^GarT3(beta3-GafeclDsyfii3nsfefase) - 207 
12672 AA152281 Hs78601 uiDpofphyrfrogen dec atx t iyfase 207 
118052 N53360 Hs.165133 ESTs 207 

102387 U41163 H&229731 sohitecanfer family 6 (neurotransnflte 207 

127305 AA535148 Hs255277 ESTs 207 

101182 L19711 HsJ6111 dystroglycan 1 (dystrophirvassodatedgi 207 

131111 R33245 H&23076 ESTs;Weak^slndbrtoputafive[pjeieg 207 

112441 R63388 Hs28412 ESTs 206 

117796 N48571 Hs.46689 EST 2S& 

116099 AA456309 KsJ8831 regulator of Fas-ina^ajuptosb 206 

125559 AA307550 Hs.1 19571 coOagen; type III; alpha 1 (Ehter&Oanl 206 

135271 AA397763 Hs37582 ESTs 206 

100)83 AA418545 Hs31659 thyroid hcnnons receptor-associated prot 206 

133419 U67369 Hs73172 grcwft facta irxteperrientl 206 

127816 AA743646 Hs/l20SO4 ESTs 206 

127502 AA614422 Ha. 183502 ESTs 2GS 

129371 M10321 Hs.110802 von Wffietaandfector 206 
108417 AA075716 zm89e5.s1 Stralageneovanan cancer (§93 

CUUSTHR1N PRECURSOR (HUMAN);, mRNA sequ 206 

102837 U94585 Hs.13435 requiem; apoptosis response zinc fnger 206 

124226 K62396 Hs.190266 ESTs 206 

102254 U28131 Human HMGWchimerfc transcript mR^ 206 

128472 X87212 Hs.10029 cafiiepsrnC 206 

107545 Z82022 Hs36433 Q^tehyHaho^hatB (UDP^teeiylgtiicosam 206 

135311 M36089 HsJ8493 Xray repair cornptemaiftig defective rep 206 

121727 AA420973 Hs.104234 ESTs 208 

131846 U02619 Hs331 genarai transcrip&n factor IOC; polyp 206 

12)415 AA235810 Hs.182522 ESTs 206 

110529 K57686 Hs37486 ESTs 2.06 
104996 AA112307 Hs.105894 Homo sapiens mRNA; cDNA DKFZp434G231 (fir 206 

110351 H41222 Hs.196439 ESTs 206 

131261 AA223746 Hs.171776 mceito((inyoM(or4)-rtt^^ 1 206 

110585 H62223 Hs.133526 ESTs; Weakfy similar to Ilfl ALU SUBFAMI 206 

129420 AA234259 H&99816 ESTs Z06 
103796 AA1 12595 H&31146 Human DNA sequence from clone 1042K10on 

lyase (EC 4-122; Aoenybsucdnase; AS 

3). Contains ESTs; STSs; GS 206 

119782 W72982 Hi58262 ESTs 206 

108641 AA1 12)59 ATP synthase; B+ transpodng; mitocnond 206 

134875 U66672 Hs.180513 ATP-bindlng cassette; sub-tarnfly A (ABC1 206 

106832 AA482015 H&30114 ESTs; HigWy S8n3arto C8 [Fisapiens] 206 

109403 AA224413 Hs.86937 ESTs 208 

115485 AA2&7667 Hs.188804 ESTs 206 

102923 X12517 Hs.1063 small nuclear rrbonudeopratain pofypept 206 

123320 AA49S792 Hs.139572 EST 205 

111901 R39C66 Hs.17638 ESTs Z05 

106558 AA455111 Hs.1 82447 htdercgenec^rcrclearr^ 205 

126885 AA293052 Hs.10101 ESTs; WeaHysimfer to ceded for by C. 205 

113429 T85190 Hs.179808 ESTs 205 

102270 U30255 Hs.75888 ptosptogfuconate dehydrogenase 205 

103204 X72475 Hs.1 92989 ftsapkns mRNA for rearranged kj kappa I 205 

106666 AA461072 H&37916 ESTs 205 

100947 HG907-HT907 Mg44 205 

102578 U60666 Hs57693 tests speaSc teuefce rich repeat prot 205 

105827 AA398255 H&31520 ESTs 205 

122324 AA442830 H&98921 EST 205 

101025 J04823 Hs£1097 cytarhmmB c oriria^ sd bu nft VfH 205 

115861 AA431768 H&90259 ESTs; Weakty similar to alpha 1 (Rsapie 205 

108081 AAO45306 Hs.4396 ESTs 205 

133994 X74929 ' H&242463 keraSn8 205 

119131 R46700 Hs.129692 ESTs; Moderately swnilar to Ml ALU SUB 205 - 

129793 AA300151 Hs.126857 ESTs 205 

101653 M60284 Hs.161305 teriiyfcfjito receptor 2 205 

120300 AA191648 Hs.131476 ESTs 205 
105519 AA453415 Hs*763 HuDNAseo^eiratactona^WSOTchr 

7nymoc^MateCTX;fl»possfl^allB 205 

114291 Z40690 Hs.123666 Homo sapie n s mRNA M length insert cDN 205 
105747 AA293719 Ha3025f ESTs; WeaHy s&nOar to QiJCOSE-6-PHOSPH 204 
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125325 AA332944 Hs.8402 adenytefe cydase 3 104 

119978 W88623 Hs.59190 EST 2.04 

102449 1/48231 Hs.46348 braifykWn receptor B1 Z04 

101454 M21812 H&50889 myosin light chain 2 2L04 

116086 AM55904 HsJ6Q23 ESTs 2j04 

102297 1)32674 Hs, 198252 G proteb-ocupied receptor 9 2.04 

130889 D57622 H&20985 s!n3-assodated poiypepfide; 30kD 2.04 

100196 D21853 Hs.79768 K1AA01 11 gene product 104 

120967 AA398111 H&97503 ESTs 2.04 

105735 AA293096 H&32417 ESTs 104 

135031 R41604 H&9344 ESTs; WeaWy similar to ID! ALU SUBFAM) 104 

104882 AA052954 Ha29546 ESTs 104 

132619 AA404565 H&53447 ESTs; Moderately similar to Hhesto Hgh 2.04 

127993 AA847856 Hs.124565 ESTs 2.04 

116441 AA620299 H&91696 ESTs 2.04 

102272 U30610 Hs.41682 kffler cell fedto-Bce receptor subfem! 2.04 

119566 W38209 Accession not Hsted In Genbank 104 

116622 D81171 Hs.45208 ESTs; WeaHystoOar to collagen type VI 2.04 

127162 AA248620 Hs.1 66011 catenh (cadterfoassociated protein); d 104 

116870 H67146 H&38564 ESTs ISA 

115448 AA284845 Hs.165051 ESTs 2.04 

127231 AA434584 zw52c03j1 SoaiesJtotaLfeius_Kb2HF8_9w 104 

103457 X99728 Ksapiens NDUFV3 gene, exon 3 ISA 

134737 U00802 Hs.89434 drebrin 1 ISA 
117046 H89505 yu81tt.s1 Scares fetal fiver spleen 1 NFL 

to contains AIu repetitive element;, mR 2SA 

124579 N68345 Hs.127179 ESTs; WeaWy simEar to TBttTOCARClNOMA- 

112132 R45970 H&236349 EST 2.04 

132281 AA133300 Hs.43803 leutocyte^assodated preceptor 2 103 

103668 Z83741 Hs-248174 H2A histone ferrfly; member M 2.03 

113501 T89107 Hs.13262 ESTs 2.03 

125021 T70060 Hs.163918 ESTs 2.03 

115754 AA420998 Hs.178095 ESTs 2XO 

123405 AA521370 Hs.191708 ESTs 2.03 

102054 U07695 Hs.155227 EphB4 2.03 

115627 AA401910 Hs.1 19175 ESTs; WeaWyshto to ZINC FINGER PROT 103 

129252 AA234663 Hs.109773 ESTs 103 

103417 X96849 H^apiens 5* mRNA cf PECAM-1 molecule 103 

133721 U11863 Hs.75741 aritoride birring protein 1 (amhe orida 103 

114176 Z39059 Hs£7267 ESTs; WeaWy similar to tetraspan TM4SF 103 

123966 C14068 Hs.21808 ESTs; Moderately sMar to similar to N 103 

134236 D45371 H&80485 appose rra)Stabur)daiTtger«trarTscn>t1 103 

116381 AA598614 Hs.65394 ESTs 103 

103711 AA046737 Hs.102792 ESTs 103 

109316 AA206914 Hs.86322 EST 103 

123793 AA620343 Hs.1 12858 ESTs 103 

128462 M69238 Hs.1 66172 aryi hydrocarbon receptor nucfear transl 103 

117690 N40467 Hs.93834 ESTs 103 

113301 T67452 Hs.13104 EST 103 

134563 AA173430 Hs35335 Homo sapiens mRNA; cDNA DKFZp564D1462 (f 

108316 AA070160 zm6ms1 Stratagene najroepfthefium (#9 103 

135239 AA454599 Hs.19399 Homo sapiens chromosome 19; fosmld 395542.03 

120342 AA207105 Hs.45068 Homo sapiens mRNA; cDNA DKFZp434l143 (fir 

103493 Y08976 Hs£34759 Haptens mRNA for FEV protein 102 

114204 Z39259 Hs26096 ESTs 102 

125425 H62307 Hs.18575 ESTs; WeaWy similar to K1AA0246 [H.sari 102 

133027 AA402624 H&63236 synucteln; gamma (breast cancer-specrfic 102 

131323 H54036 Hs35S19 deaiTHJSsodaied protBln kinase 3 102 

121515 AA412133 Hs.104696 ESTs 102 

129780 AA291526 Hs.124699 ESTs 102 

131292 AF0O5039 Hs^OOSOO secretory canter membrane protein 3 102 

132973 AA035446 H&214361 ESTs 102 

103727 AA059415 Hs.6289 growth factor receptoMxxmd protein 2 102 

113174 T54659 Hs3779 ESTs 102 

120964 AA398085 Hs.142390 ESTs 102 

134303 AA457242 Hs.8141 etopostde-induced mRNA 102 

128118 T81623 H&21765 hypc*e8caJ protein of imtatown tuncfio 102 

121087 AA398751 H&97304 ESTs 102 

102806 U90306 Human troquols-dass rwrneoa^main protein 102 

103195 X70940 H&2642 eufcaryofc trareJafion eicngatbn factor 102 
126767 C17148 C17148 Ctontech human aorta po!yA+ mRNA 102 

105179 AA189083 Hs*21974 ESTs; Moo^rately sirnflar to mBOCT [Mmus 102 
116797 H40486 ynB7a08^1 Scares adult brain N2b5HB55Y 

3 > stoo]artocorito{nsA2UR3peS!h«e 102 

133268 AA099404 Hs.69307 ESTs 102 

123951 AA621721 Hs231130 EST 102 
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115463 AA286819 H&69485 ESTs; Weafystailar to shier to ofiier 202 

110803 K65776 Hs^22403 ESXs £02 

101234 L29277 Hs.142258 s^mdtensduoerandacfi^brofbans 202 

1212)8 AA400470 Hs57505 ESTs 202 

12S98 AA453465 Hs£9329 ESTs 202 

110668 H84882 Hs33791 ESTs; WeaHy staffer to K*Ct cofranspcrt Z02 

117137 H96670 Hs.42221 ESTs 202 

119389 T88826 H9L9Q973 ESTs 201 

102940 X13956 Hjl24S98 Human 12SRNA induced by poMr^poiyf 201 

100748 HG3517-HT3711 Wpha-1-Anfijypsh.ff End 201 

103012 X52638 HsJ39 &ftaphdnx*>^^ 201 

13Z755 AA603201 Hs.182635 ESTs 201 

1^)842 H39589 Hs^01S9 ESTs; HJgtysimiar to CG)^ protein [ 201 

1335S9 M64788 Hs.75151 RAP1; GTPaseacfivafng pntein 1 201 

117250 N21081 Hs.1 5299 HMKUndutibJe v 201 

115124 AA256666 Hs39156 ESTs 201 

128155 AA926843 Hs.143302 ESTs 201 

133574 AA379087 Ha.16178 apoptosb antagonizing trartsafc&in fee 201 

132601 R78838 Hs£4943 fracture callus 1 (i^honotog 201 

117428 N27366 Hs.43933 EST 201 

121108 AA399053 H&97529 EST 201 

130518 X68550 Hs.159161 RnoGDPo^sodafioninh2to(GDI)a!p 201 

110606 H6S049 Hs.19085 ESTs; Weakly ssribrto putative p150 [H 201 



120606 AA282956 7t\SMs\ NCLCGAPjGCBI Homo sapiens cDN 

SW^DRJ^SEP3S38RETlNAL-CAa£RiNPR 



13)070 T47969 Ha 194660 ceraW^pduscinosia: neuronal 3; Juven 201 

130331 Z80783 Hs239884 H2Bhfatone family; it*»tel. 201 

109599 F02602 Hs.6749 ESTs 201 

131749 W78211 HsJ1547 ESTs; H]gNysknQaf tofMDHnibtqufnone 201 

13463 AA376905 Hs.1 11742 ESTs; WeaHy staDarto Iffl ALU SUBFAMJ 201 

114880 AA235693 Hs.65862 ESTs 201 

114745 AA135523 Ks.139064 EST 201 

115637 AA402727 Hs.76925 ESTs; HigNy simBartoR31167J!; parfia 201 

109043 AA159605 HsJ2580 ESTs 201 

128901 Z41411 Hs.107040 ESTs 201 

124427 N36812 Hs.178863 ESTs 2 

100673 HG3033-KT3194 Spfcecsoerffil Protein Sap 62 2 
108436 AA078801 zm94a9^1 StratagenecctonHTa (#937221 2 

123764 AA610019 Hs.1 12654 ESTs 2 

129343 N70791 Hs.180080 ESTs 2 

122794 AA460254 Hs.105043 EST 2 

128688 AA161469 Hs.1 03755 receptor-interacting senne4hreonine ki 2 

115592 AA399543 Hs.48026 ESTs 2 

111693 R220O7 H&23321 EST 2 

113353 T791B6 Hs.14468 ESTs 2 
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Pfcey: Unique E03 probesct identifier number 
ExAccn: Bxcnudar Accession number, Gcutepfc accegsfcm number 



UirigcoelD? Untgcoe uumber 
Uni^nc Title: Unigenc gene title 

Ptey ExAccn UniGID Complete Title Ratio BS/Mat 

106024 AA412059 Hs.1 11742 ESTs; Weakly similar to lilt ALU SUBFAMI 0.17 

110930 N48603 Hs.14947 ESTs 0.18 

105772 AA347973 H&221132 ESTs 02 

133271 Z48633 Hs£940 Haptens mRNA for refrotransposon 02 

107109 AA609943 Hs32793 ESTs 024 

103593 FQ2506 Hs.159591 thyraa homione receptor kitefactor 8 0.24 

123016 AA480103 Hs.111730 ESTs; WeaHy similar to alternatively sp 0.25 

100739 HG3484-HT3678 Proteh Kinase (GtxM59287) 025 

130252 U92014 Hs.1 53527 Human ckme 121711 defective mariner tra 0.26 

105149 AA169253 Hs3958 ESTs 026 

115412 AA283804 Hs.1 93552 ESTs 027 

105952 AA405263 Hs.1 81 400 ESTs 028 

106596 AA456981 Hs35349 ESTs 0.28 

120249 AA167567 Hs.133325 ESTs 028 

111676 R19414 Hs.166459 ESTs 039 

111161 N66767 Hs.124145 ESTs 023 

109364 AA215379 Hs50418 ESTs 029 

132316 U28831 Hinnan piD^lmnuino-reacthFewithanfl- 03 

104030 AA363131 Hs222992 ESTs; Weafdy smiflar to TRANSFORMATION-S 0.3 

109825 F13663 Hs.16798 ESTs 03 

111110 N63165 H&23618 ESTs 031 

135315 W90583 H&9853 ESTs 032 

104792 AA029288 H&29147 ESTs; Highly similar to ZIN C FINGER PROT 033 

123562 AA608893 Hs.190085 ESTs 033 

116079 AA455286 HsJ54982 ESTs; WeaWysimflar to Ml ALU SUBFAMI 033 

110671 H87770 Hs.153800 ESTs 033 

108819 AA130988 Hs.193253 ESTs 034 

115558 AA393806 Hs.1010 regulator of mitotic spmdb assembly 1 034 

104781 AA026617 Hs21610 ESTs; Highly simBarto BAl1-assodatod 034 

111236 N69324 Hs.12526 Homo sapiens done 23903 mRNA sequence 034 

113341 T77866 Hs.189703 ESTs 035 

125371 AHJ84676 Hs.133266 ESTs; Moderately similar to Say7-Gke p 035 

115890 AA435853 Hs.44114 ESTs; Weakly simitar to CGl-73 protein [ 035 

113571 T91116 Hs.15713 ESTs 035 

121683 AA417911 Hs.175863 ESTs 035 

105489 AA256157 Hjl24115 ESTs 035 

116320 AA490866 Hs39429 ESTs 036 

111917 R39882 H&21397 ESTs 036 

127568 T53722 , ya91c06.r3 Stratagem* placenta (#337225) 036 

123541 AA608794 Hs.1 12592 ESTs 036 

123131 AA487207 Hs.193272 ESTs 036 
125069 T86914 Hs.194485 ESTs 036 
114757 AA136725 Hs.1 61 990 ESTs 037 
132778 AA446895 Hs£671 Homo sapiens ctone 23926 mRNA sequence 037 

123132 AA487233 Hs.106711 eukaryofctraretatoJnffiatofector 037 
134029 AA378597 Hs.143601 ESTs; Moderate^ similar to 67A9.D [Done 037 
126956 AI434405 Hs.171957 triple fortctionaJ domain (PTPRF Interact 038 
106869 AA487563 Hs.188813 ESTs 0.38 
107818 AA02D957 Hs.167948 ESTs 038 
129974 K00629 Hs.199300 Human kpni repeat mma (aha done ped-k 038 
129477 D49728 Hs.1 119 nuclear receptor suMamfly 4; groupA;m 038 
119369 T79Q20 Ha245915 ESTs; WeaJ^simOarto Wnase-refatedp 039 
114021 W91995 Hs.16145 ESTs 039 
122024 AA431296 Hs.139433 EST 039 
130014 N50959 Hs.143102 amine oxidase; copper containing 2 (rati 039 
110163 H19326 Ha22073 ESTs; Highly similar to J KAPPA-RECOMB1N 039 
104641 AAD04652 Hs.18564 ESTs 039 
124777 R41933 Hs.140237 ESTs 039 
125382 AA713494 Hs.1 94660 ceroia^ptfusciiosis; neuronal 3; ]uven 0.4 
120406 AA234999 Hs.1 11279 ESTs; WeaWy similar to unnamed protein 0.4 
132734 R23653 Hs.1 64250 ESTs 0.4 
117001 H84719 Hs.40721 EST 0.4 



120905 AA371602 Hs.182930 ESTs; Highly simBarto PHOSPHATTDYUNOS0.4 
125488 AA355158 Hs.41181 Homo sapiens mRNA; cDNA DKFZp727C1 91 (fir 0.4 
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121989 AA430C44 Hs.193784 Hon»sapteniRNA;cDNADKFZp586K1922{f 0.4 

127921 AAB06616 H&2QS523 ESTs 0.4 

119830 W7470O Hsl53478 ESTs 0.41 

106292 AA435571 Hs.148560 ESTs a41 

102762 UB23Q3 Hs.12308) HDmosaptourfato^pro^mRNA;paifi a41 

113518 789731 ye11tD&s1 Stmtagane tung (§937210) H 3 

fa) contsins AIu FGps^vB stemBT^oofit 0.41 

100635 HG2724-HT2820 OraxsgeraTfe/Ch^FustonAcflvated 0.41 

113319 T7Q358 Hs.193141 ESTs; WeaHy slmSar to coding sequence a41 

121319 AA4Q2935 Hs.194242 ESTs; Weak* similar to Iffl ALU CLASS B 0.42 

111818 R34382 H&24779 ESTs 0.42 

104883 AAQ52959 Hs.177409 ESTs;H^sirn3aftodJ1119D9J2|Hsa 042 
129258 W95592 Hs^51948 ESTs; Moderately sinter to POLYADENYLAT0.42 
130576 T86475 Hs.16193 Homo sapiens mRNA;cDNADKFZp586B211 (fr 0.43 

106354 AA443271 H&28764 WAA0546 proten a43 
108841 AA132524 Hs.70614 ESTs 0.43 
113922 W80741 Hs.37890 ESTs 0.43 
120997 AA398285 H&97598 EST 0.43 
108158 AA054597 Ha221935 ESTs 0.43 
124518 N58185 Hs.131630 ESTs 0.43 
114477 AA032O13 Hs.144260 EST 0.43 

104290 C16G52 Hs.107205 Homo sapiens mRNA; cDMA DKFZp434L2221 (f 0.43 

126700 AI318412 Hs.1 08253 ac&i binding protein; macraphin (iracrof 0.44 

110887 N38770 Hs.4283 ESTs 0.44 

116141 AA460420 Hs.44949 ESTs 0.44 

110689 H93046 Hs.15571 ESTs 0.44 

115314 AA280563 Ks256501 ESTs a44 

110904 N33453 Hs^7371 Homo sapiens mRNA; cDNADKFZp566J123(fr 0.44 

109482 AA233375 Hs.78085 ESTs 044 

102284 U31449 Hs.11881 transmen*fane4sipen^ 0.44 

118654 N70582 Hs.49892 ESTs 0.44 

115334 AA281244 Hs£5300 ESTs 044 

113149 T51588 ESTs; Moderately similar to ID! ALU SUB 0.44 

113721 T97931 Hs.18190 EST 0.44 

111299 N73808 H&24936 ESTs 0.44 

103778 AA094107 Hs.7187 ESTs; WeaWysWbrtosirn2artogIyo3 0.44 

113204 T57865 Hs.10310 EST 0.44 

100315 D50857 Hs.B2295 ctedlcatarof cytokinesis 1 0.44 

115254 AA279024 Hs.194437 ESTs 0.44 

125500 H46104 Hs.244624 ESTs 0.44 

117387 N26011 Hs53810 ESTs 0.45 

135113 W42450 Hs^06833 ESTs 0.45 

124517 N58204 Hs.199945 ESTs 0.45 

120379 AA227849 Hs*238380 Human endogenous retroviral protease mHN 0.45 

119205 R91954 Hs.153699 ESTs 045 

128266 T70341 Hs.131897 ESTs 0.45 

104106 AA422123 Hs.42457 ESTs 0.45 

115864 AA432080 Hs31200 ESTs 0.45 

113771 W02695 Hs.18714 ESTs 0.45 

126515 A1124649 Hs^52703 Homo sapiens nflNA; cDNA DKFZp586O031 (fr 0.45 

127823 AA524806 Hs.78869 transcription ekmgafion factor A (SII); 0.45 

1166G5 FM405 Hs223654 EST 0.45 

106355 AA443272 Hs£7836 ESTs 045 
132693 AA621429 Hs55075 WAAO410 gene pioduct 0.45 
107388 W01587 Hs.173319 ESTs 045 
110688 H93021 Hs.162937 pepfidyfproJyi tsoir*fraA(cydoph^ 0.46 
116893 H69569 Hs.191316 EST 0.46 
105375 AA236542 H&9512 ESTs; Moderately slmfer to SQ ALU ^iB 046 
115601 AA400277 Hs.46849 ESTs 046 
106895 AA489707 H&29896 ESTs;WeaWyslntotoproSr»TCh pro 0.46 
111770 R27975 Hs.1 87469 ESTs 0.46 
115663 AA405838 Hs.40507 ESTs 0.46 
131404 AA504744 K&26461 ESTs; Weafdy similar to gc^ich sequence 0.46 
108622 AA101828 Hs.169956 ESTs 0.46 
128286 AHE5771 Hs.144090 ESTs 0.46 
105760 AA338960 Hs3170 ESTs 046 
100020 AFFXcontn*BoW 046 
1(5209 AA205072 H&227743 KAA0980 protetn 0.47 
111975 R41724 Hs.149566 ESTs 0.47 
114688 AA121403 Hs.144331 ESTs 047 
116994 H83918 Hs.40528 ESTs 0.47 
118401 N64762 Ks.49053 EST 047 
110997 N52S40 Hs.74316 desmoptaWn DPII) 0.47 
123791 AA620331 K&245351 EST 047 
13858 HQ2266 Hs.167451 ESTs 0.47 
115470 AA2S7122 Hs.48391 ESTs 047 
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130SG8 M402109 Hs. 16533 ESTs 047 

116067 AA454827 Hs.124823 ESTs 047 

125881 AA775807 Hs.150741 2 , ;3*-cycfc nucleotide 3* phosphodkste 0.47 

124028 F04112 Hs.177178 ESTs 047 

108395 AA155574 Hs.172702 ESTs 047 

125102 T95105 Hs.173772 ESTs 047 

110421 H48462 H&36093 ESTs; Weatty slnilar to reverse transcri 047 

105658 AA282914 Hs.10176 ESTs 047 

129046 AA195678 Hs,108258 acfin binding protein; macrcphln (micro/ 047 

113638 T95128 Hs.17529 ESTs 048 
132575 AA045365 Hs.5188 ESTs; Weakly similar to 60S RBOSOMAL PR0.48 

132592 AA123390 Hs£285 ESTs 048 

107619 AA0O4955 Hs.60015 ESTs 048 

118684 N70907 HsJ230619 EST 0.48 

127612 AA917801 Hs. 11 6076 ESTs 0.48 

112319 R55615 Hs.26432 ESTs; Weatyssmfer to finger protefnH 0.48 

113635 T95087 Hs.15543 ESTs 0.48 

119344 T62969 Hs. 133348 ESTs 0.48 

121080 AA398720 Hs.177953 ESTs 0.48 

133686 X83378 Ha211614 cNoitie channel 6 0.48 

130395 R54534 H&87889 hefcasfrmoi 0X9 

127530 AA563806 Hs.145728 ESTs 0.49 

132971 AA033951 H&61700 ESTs 0.49 

127132 AA721156 Hs.1 90440 ESTs 0.49 

129980 T72681 Hs.13969 ESTs 0.49 

105323 AA234112 Hs29075 ESTs 0.49 

114439 AA018937 Hs.128629 ESTs 0.49 

107632 AA007242 H&60179 EST 0.49 
130952 AB002296 Hs21560 Human mRNA for WAAQ298 gene; comptetec049 

127595 AA927308 Hs.130464 ESTs 0.49 

124276 H83465 Hs^21934 ESTs 049 

125935 H30721 Hs30172 ESTs 049 

131275 U45974 Hs25156 Human phosphaWyf&wsftol (4;5) bfephos 0.49 

131196 C20633 Hs24129 ESTs 049 

125505 AJ127843 Hs, 155071 ESTs OS 

113327 T71776 Hs.12097 ESTs 05 

104709 AAD17146 Hs.34579 ESTs; Moderately simBar to IHJ ALU SUB 05 

115772 AA423972 Hs<8154 ESTs 0.5 

118296 N63150 Hs.48723 ESTs 0.5 

131453 C20596 H&26985 K1AA0457 protein OS 

104734 AA019528 Hs32677 ESTs OS 

119358 T70550 Hs.193651 ESTsiWeaWysIn^toaltematlvBlysp OS 
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Table 19: B survivor vs Mets - Up in B survivor 



Pkcy: Unique Eos probed identifier nmnber 
HxAcac Ryf^ytftr A r Tf^t fm mnrfry r^ G ta ha n k »«n?""Ti " rr mtwr 



Unigene Tiilc Umgene gene tMe 



Pkey Ex Accn UniGJD ComptetB TBte Ratio BS/Met 

333601 CH22J=GB*ES213J 55 

3S300 Cai1Jisgil5B66908 457 

333642 CH22JGBES23U 4j64 

333591 CH2*J=GB*ES2Q8J 4.46 

332859 CH2*J=GBflES27J! 439 

304013 AW518573 Ha. 156110 tmrramogbbiiin kappa variable 1 423 

333791 CH22J=GB£S274_10 4.18 

327641 CH.04_hsgP67890 4.03 

321172 H49160 Hs.133472 ESTs 3.9 

334125 CH2?J=GBtE&334 4 358 

333646 CH22_FGeE&234_2 358 
326554 CR19jBgij5867308 3.84 
333650 CH22/GBCS238J 352 

333647 CH22_FGBJES235_^ 3.79 
333626 CH2£/GBES224J 358 
314671 AW236550 Hs.131914 ESTs 3.68 
310847 AM20523 Ha.1 61282 ESTs 3.67 
333657 CH22_FGBES241_2 3.65 
338522 CH22_BftAC0(SaiaGENSCAN^9W6 3.64 
329464 OiYjBgq6455788 35 
328868 CK07Jjsgq6381930 3.6 
333637 CH22_FGBES229_2 359 
329737 CK14jj2gP65779 35 
317828 AI791749 Hs.128896 ESTs 3.44 
330520 M96995 Hs.6289 grew* feeler receptor-bo^ 3.44 
339271 CH2^JA354l1ZGENSCAN.11-2 3.44 
314927 AI735482 Hs.1 59580 ESTs 3.42 
334782 CH22J=GBES.432_7 3.42 
313138 AW138842 Hs.1 96669 ESTs 3.4 
332650 H51596 Hs5541 ATPase;Ca++ transporting; ubiquitous 338 
338648 CH223WC0055MGENSCAN.460S 338 
325677 CH.14jBg^67017 354 
312639 H50648 Hs213221 ESTs; WeaWy similar to [III ALU SUBFAMJ 3.33 
326545 CH.19J6 0^867307 352 
318364 R44616 Hs.138280 ESTs; Moderately sfcnilar to 01! ALU SUB 35 
308385 AK25428 EST singleton (not to UniGene) with exon 326 
328569 CaO7Jisgp)0448O 326 
328582 CR07Jcgi|6006033 324 
310975 AI492857 Hs.170940 ESTs 324 
336883 CH2£J=GENEa3222 321 
324425 AW236939 Hs.172154 ESTs 32 
337870 CH2^atAC005aXlGBKCAN.4^3 3.19 
308624 AI001043 EST siigteton (not [nUniGene) with exon 3.17 
310)91 Z45264 EST duster (noimUniGene) 3.16 
335247 CH2^FGBCS516J 3.12 
324945 AAD88768 EST duster (rid to UniGene) ai 
319468 R06504 EST cluster (not InUnJGene) 3.09 
301635 AI590720 Hs.1 92662 ESTs; WeaWysnnbr to ZINC FINGER PROT 358 
321215 AW378128 Hs.1 20243 ESTs; WeaHystaferto CGW6 protein [ 3.04 
328507 Ca07Jisgp68473 103 
330266 CH55j>2 936671 B85 352 
326249 CH.17Jisgip867263 101 
325649 CH.14jBg!)6588011 259 



304575 AA496437 EST singleton (not tn UreGsie) w3h exon Z9B 

304559 AA488050 ESTstnglBton(riottnUreGOTe)watiexon 257 

338412 CH2^B!AC00550aGQ4SCAN541-25 256 

303707 AT769997 EST singleton (not tn UniGene) with exon 255 

313027 N34307 H&184003 ESTs; WeaHy similar to Ifll ALU SUi^AMJ 255 

305590 AI00Q246 EST smgfeton (not tn UniGene) wftexon 255 

306183 AA922622 EST stogfetan (not to UniGene) wffii exon 254 

303611 AI735372 Hs^O3820 EST; Mocteratefy similar to TRANSNATIONAL 254 

332454 T63265 Hs.1 11 86 ESTsjWeaJ^sbdartotransformafion^ 254 
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330061 




CH.17j2gip721261 


294 


317671 


AW138139 Ha244598 


ESTs 


233 


338705 




OC^EfAAC005500.GBISCAN.4804 


233 


333737 




CH22J=GBJE&261J 


29 


337756 




CH2^ - PAAC0raQ97.GB4SCAN.1093 


25 


333572 




CH22J=GENES.189J 


289 


335349 




OC^_FGEHES^39^2 


289 


328835 




CR07JsgiJ5868339 


289 


319886 


AA984628 


EST duster (not in UniGene) 


288 


311247 


AI655313 Hs.197692 


ESTs 


287 


303887 


R72672 Hs.193484 


ESTs; Weaty similar to Similarity with 


2.86 


337564 




CH22_CKE1.GENSCAN.1-7 


285 






CH22_FGBES.107J 


284 


314938 


AA515635 


EST cluster (not in UniGene) 


283 


305803 


AA846052 


EST singleton (not in UniGene) w3h exon 


283 


305264 


AA679505 


EST singleton (not In UniGene) with exon 


283 


332646 


AA386264 H&5337 


bodbafe dehydrogenase 2 (NADP+); mlto 


281 


338508 




CH22^fAAC0055C0.GENSCAN391-1 


281 


308097 


AM75411 


EST stogteton (not En UniGene) with exon 


281 


301130 


AW194167 Hs.149418 


ESTs Weakly similar to safrary proline 


28 


325571 




CH.12Jisgi|5552439 


28 


307054 


AI148181 Hs.176835 


EST 


28 


337456 




CH22J=GENES.777-2 


279 


317870 


AI797066 Hs.201995 


ESTs 


279 


303171 


AA065003 Hs£4179 


hypothetical protein 


278 


333717 




CH22J=GBES.253_3 


276 


303778 


AW505368 


EST duster (not to UniGene) wifo exon h 


276 


304918 


AA602697 


EST singleton (not to UniGene) with exon 


278 


319373 


R00371 


EST cluster (hot In UniGene) 


275 


336072 




CH22_FGENES£85_4 


274 


306023 


AA897764 


EST singleton (not to UniGene) with exon 


274 


336127 




CH2ZJGB4ES.701J5 


274 


337355 




CH22_FGENESJ28-1 


273 


337885 




CH22^tACC05500.GENSCANM3 


273 


308506 


AI686791 Hs.119588 


ribosomal protein L3 


273 


300629 


AA152119 Hs.155101 


ATP synthase; H+ transporting; mJtochond 


273 


333043 




CH22JGENES.70J 


272 


327736 




CR05jisgip867940 


272 


333007 




CH22JH3ENES.6Q_4 


272 


321966 


AL122111 


EST cluster (not to UniGene) 


272 


323179 


AW452576 Hs. 156875 


ESTs 


272 


332459 


AA609625 Hs.112933 


Homo sapfens Tax hteracSon protein 40 


271 


326224 




Cai7Jjsgfj5867230 


271 


329114 




CHXhsgP68650 


27 


333577 




CH22J=GBiES,196_2 


269 


300413 


AW09Q347 HsJ243443 


ESTs 


267 


304055 


R07994 


EST stop^eton (not In UniGene) with exon 


267 


301013 


AI935304 Hs.125262 


.DKFZP586G1624 protein 


267 


337848 




CH2a^tAC005500.^(SCAN^3-1 


266 


327946 




CR06Jisgij58682D6 


268 


306300 


AA937573 


EST stogteton (not In UniGene) with exon 


266 


331071 


R01646 H&200538 


ESTs 


265 


304841 


AA587541 


EST stogieton (not In UniGene) with exon 


265 


301321 


AI860987 Hs.189097 


ESTs 


265 


311280 


AI767957 Hs.197737 


ESTs; Weakly simfiar to Y38A8.1 gene pro 


265 


338843 




CH22JU246D7.GENSGAN.8-1 


264 


335720 




CH2*J=GBJESS99J3 


264 


333670 


• 


CH22_FGENES^45_4 


264 


313588 


A1803591 Hs^09667 


ESTs 


264 


335750 




CHZLFGBC&6Q2L4 


263 


333240 




CH23J=GB€S.111jt 


263 


332721 


R70212 Hs.79630 


CD79A antigen (immunogtatman-assodated 2.62 


338747 




CH22LENtACC05500.GENSCAN^11-1 


262 


303582 


AA377444 


EST duster (not in UniGene) wffli exon h 


262 


336898 




CH22JGENESJ30-1 


262 


325835 




CH.16J»oJB552452 


262 


301660 


F13112 


EST cluster (not to UniGene) wffli exon h 


261 


335968 




CH22J=GENE&652J 


261 


336705 




CH22J=GBJES.63-2 


26 


308815 


AW292760 


EST stogteton (not In UniGene) with exon 


26 


339220 




CH22JF113D11.GENSCAN.6-15 


26 


308582 


AI709056 


EST stogieton (not to UniGene) with exon 


26 


334260 




CH22J : GENE&387Jft 


26 


309963 


AW449073 


EST stogieton (not to UniGene) with exon 


26 


300178 
335690 


A1282665 Hs.166969 


ESTs 


259 




CH22_FGENES596_5 


259 
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3C8127 AJ4921&7 


ESTsfcgteton (rothUnSSene) wftexon 


1 CO 


337835 


CH22_BfcAC00550Q£BJSCAN^2-4 


O CD 

LJSo 


333251 


CH2^PGBES.11^ 


* CO 

158 


330319 


Ca«Lp2gi^32415 


4 CO 

Zoo 


314490 AI758114 


Hs.197032 ESTs 


Laf 


305934 AA878S15 Hs75442 albumin - 


9 C7 
Lai 


329655 


CU14_p2g56272ia 


Lai 


328558 
336094 


Crl07jBgjp&8489 


Lai 


OC2J=GB<ES.691_3 


257 


307899 AB8QZ70 


ESTsingjdm(ra34hUniGene)wfih€Mon 


151 


339312 


CH22_BA354111GB*SCAN.22-1D 


157 


335442 


CH22JGB4ESAZ7J 


257 


317894 R60848 


EST duster (not In UnJSene) 


156 


330435 HG268&HT27B5 Mucin 5b, Tiacheabroiffiha (GkX74955) 


156 


327X4 


CHj01Jisg#867494 


2.56 


308859 AI830787 


EST singleton (not In UnSGene) wffii exon 


2.55 


302224 AI351549 


Hs.161166 WAA1094pmteJn 


255 


304324 AA137045 


ESTsing!8iOT(notInUna3ene)waexDn 


254 


338090 


CH22_BftAC0teaXl.GBISCAN.17M 


253 


334797 


CH22J=GeJES.434_5 


152 


303535 ALM343Q 


EST cluster (not in UnBene) wSi exon h 


252 


339037 


CH22J3A58Hl8uGeiSCAk2&5 


252 


327846 


Oi05Jisgi)6531962 


252 


325271 


CH.11Jtsgi}5866901 . 


252 


312385 R42885 


Hs^15555 ESTs 


251 


302816 AI733918 


Hs204112 ESTsiWeaWysfrnfertoatemafivehsp 


251 


316841 AVW49871 Hs.124591 ESTs 


25 


300184 AI285912 


Hs^4515 ESTS 


25 


333762 


CH2ZJGBIES2J0J. 


25 


317028 AA962623 


Ks.189144 ESTs; Weaty similar to RENAL SODIUMOEP 




CH.17jsgq5867264 


2.49 


326005 


CH.16_hsgIj58S7112 


2.49 


301971 AJ003125 


Hs. 120330 a dtsinfegrin-B© and metaloprotease ( 


2.48 


326539 


CH.19_hsgl)5867307 


2.48 


338896 


CH2^J3J32J10.(^SCAMM 


2.48 


306773 AI04O750 


EST shgteton (not in UniGsnB) wfti exon 


2.47 


336279 


CH22J-GENES.763J 


2.47 


321017 AL050345 


I&227637 hypothofica] protein 


2.47 


306090 AA908609 


EST singleton (not in UniGene) with exon 


2.47 


333216 


CH22J r GENES.104J 


2.46 


338593 


CH22 - BAAC005a)0.GB<SCAN.435-2 


2.46 


333587 


OE2J=GENESJa)5J 


146 


300396 AW295466 H&232051 ESTs 


2.45 


304693 AA554263 


EST singleton (not (n UniGene) wfth exon 


2.45 


338934 


CH22JXJ32H0.GENSCAN.18-2 


145 


325751 


CH.14J1S g^82474 


2.45 


334137 


CH2^J=GBDE&337J 


2.45 


333581 


CH22_R3BES200J 


2.45 


302083 AI422807 


Hs.134012 C1q-reiated factor 


2.44 


307318 A12C8577 


EST shgteton (not in UniGene) with exon 


2.44 


302181 AW374284 Hs. 157732 Homo sapiens chromosome 19; cosn«JR26 


89 


337425 


CH22JGBIES.761-1 


144 


336227 


CH22JH5BC&730J? 


144 


314657 AKH5953 


Hs. 125265 ESTs 


144 


338529 


CK2^E^OXfi500.GB4SCA(U98-10 


144 


333680 


CH2?_FSENES247J7 


143 


324834 AJ003258 


Hi250891 ESTs 


143 


305093 AA642917 


ESTshgleton (no(lnUnjGene)wfihexm 


143 


335787 


CH23J3BK&611J 


143 


311704 AI655206 


Hs.121512 ESTs; Moderately similar to kiiesin fte 


143 


329382 


CHJOtsgI|5868868 


142 


334785 


CH22JG9Ca43^10 


142 


330130 


CK21_p2gP)02196 


142 


327206 


CH.01_hsgq5867447 


141 


319235 F11330 


Hs. 177633 ESTs 


141 


334691 


CH22jq(5BCa420J 


14 


327610 


CHJ04jBg^867K8 


14 


327646 


Crl04Jsg^867894 


14 


337093 


CH22_FGEf€S.465-18 


14 


335081 


CH22.FGENES.488 4 


14 


333576 


CH22_FGBES.193_J> 


14 


337604 


CH2ZC20H12.GB4SCAN.1&-5 


14 


329879 


Cai5j)2gil6466518 


14 


328444 


CRO7Jtsgip888420 


139 


335700 


CH22J=GB*ES59fcJ 


139 



331® Z41009 K&21446 ESTs; Weakly sfrnDar to HYPOTHETICAL PRO 139 
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327927 




CK06Jogp868173 


9_39 


334354 




CH22J : GENES377J 




308517 AI689279 




EST singteton (not In UniGene) wflh exon 


ZdSJ 


303869 AW499648 HsJ233750 crate V 


9 40 
ZOif 


333848 




CH22JGB*E&237,2 


9 4fl 
ZOO 


318318 A!653893 


Hs.174463 ESTs;WeaWy5&nfartDalpha3bsubuna 


9 4fl 

Zoo 


338336 




CH22_pytAC0K500.GB^SCAN31(W 


9 on 

ZOO 


304125 H40976 




EST stagtetai (not In U nlGene) wift exon 


9 4fl 
ZOO 


304983 AA617786 




EST singleton (not In UniGene) wflh exon 


9 4ft 

Zoo 


334935 




CHZLFGENE&464J 


9 40 

Zoo 


314326 AW170057 Hs.133179 ESTs 


9 4ft 
ZOO 


33)408 D49490 


Hs.76901 


for protein disulfide feonietese-fBlated 


9 4fl 

Zoo 


307648 AI3G2236 




EST singleton (not in UniGene) with exon 


9 4ft 

Zoo 


338911 




<mjW32J10.GENSCAN.11-3 


9 Oft 
ZOO 


319952 T79532 


Hs225725 ESTs; Moderate^ sknflar to CGM01 prot 


zo7 


336878 




CH22_FGBJES.31&-5 


zo/ 


338140 




CH2^EFAAC005500.GENSCAN^03-6 


1 4T 

Zo7 


300564 AB83878 


H3.225588 ESTs 


Zo7 


304635 AA523976 




EST sfogteton (not in UniGene) wfth exon 


Z37 


334091 




CH22J=GENES327_47 


237 


338328 




CH2^J=GBiE&812J 


2.37 


325310 




CKHJis 0^5866864 


Z37 


338043 




CH2Z_BAAC005500.GENSCAN.153-2 


Z07 


307090 AI161024 




EST singleton {not in UniGene) with exon 


Zo7 


335768 




CH22JFGENESJ8U7_2. 


237 


334969 




CH22J=GB*ES466J2 


n try 


333540 




CH2Z_FGENES.230 JL 


Zoo 


330002 




CR16_p2gi]6623963 


2J36 


338829 




CHZZJW246D7.GENSCANS-12 


236 


323808 AW250114 




EST duster (not in UniGene) 


Zoo 


327755 




Ca05Jisgil5867955 


235 


306426 AA975039 




EST stngjeton (not fri UniGene) with exon 


235 


336481 




CH22_FGBJEa830J 


235 


335163 




C822J=GBJE&50£J 


235 


322012 AL137357 




EST duster (not in UniGene) 


235 


337345 




CH22_JGB*E&723-1 


235 


334625 




CH2ajGaffi&414_3 


235 


320957 AIB7B933 




EST duster (not in UniGene) 


235 


334915 




CH22J=GENE&457_4 


235 


336295 




CH22J^GENES.787_1 


235 


321556 N46402 


Hs.14570 


ESTs 


235 


338491 




CH2?_EMACO(6500.GENSCAN385-2 


235 


335517 




CHz?J=GBJES.571_.34 


234 


330639 X90872 


Hs.75354 


SULTICsutfotransferasa 


*» 4JI 

Zo4 


310383 AI263102 


Hs.145596 ESTs 


234 


331526 N49967 


Hs.46624 


ESTs 


234 


3343% 




CH22_R3B*ES381^2 


234 


332993 




CH2?J=GBC&57_2 


234 


327487 




CH05Lhsgi)5867785 


234 


335920 




CH22_FGENES.636J6 


233 


336463 




CH2g_FGBES.829Jg 


233 


319000 Z44318 




EST duster (not h UniGene) 


233 


332992 




CH2?J=GBJE&57J 


233 


332920 




CH22_FGENE&37_6 


233 


337590 




CH22_C20H12.GENSCAN.65 


233 


327059 




ai21_h3gip531965 


233 


334399 




CH22J=GBIES382_5 


233 


300982 AA837754 




EST cluster (notfri UniGene) wffii exon h 


232 


327430 




CR02_hsgIp887754 


ZoZ 


326808 




CK20J»gQ6682504 


o oo 
Zoz 


309324 AWD15373 




EST singleton (not in UniGene) wth exon 


Z3Z 


329779 




CR14_p2gi[6002090 


232 


330492 M25809 


Hs.64173 


ATPase; H+ transporting; lysosomal (vacu 


9 44 

Zoi 


330080 




CH.19_p2gp)15314 


231 


334342 




CH22_FGENES375_^0 


9 44 

Zoi 


336306 




CH22_FGENES.793J5 


9 4.1 
ZOI 


336400 




CH22J=GENES.823J5 


9 44 
ZOI 


323735 AA323714 




EST duster (not In UniGene) 


231 


334496 




CH2*JGENES397_12 


231 


336075 




CH22_FGENE&687J 


231 


335566 




C«3LFGENES580J 


231 


337657 




CH^_BAAC000097.GENSCANJ2-9 


231 


327816 




CRG5jBgp867968 


23 


308465 AI672480 




EST singteton (not in UniGene) with exon 


23 


330112 




CH.19_p2gp)15238 


23 


304465 AM21948 




EST singteton (not in UniGene) with exon 


23 
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328171 
328271 



AI66Q854 



330063 

312281 H11643 



333859 
326253 
3E703 



EST sfngieton (not tn IWGece) with e 

CR0SjBgP68071 

CRC6 Jegpa52415 

Oi07_hs ^8004475 

Cai9j2gIjBie044 

EST cluster (not hUnKSets) 

CHiS_h3g}p86852D 

CH2^J=Ge(HS.290J8 

CH.l7JsgP872S3 

CH.14jBgipG7Q28 

CH22_DJ32imGENSCAN.1^3 



337244 CmjGBESJ&m 

314770 AI732722 Hs.187694 ESTs 

324560 AW5Q2208 EST duster (not h UnJGene) 

310603 AW376860 Hs.156338 ESTs 



337363 
308015 AI440174 
309208 A1961962 
337455 
327605 

301611 W22172 
317222 A1206964 
338278 
337291 
337913 

306406 AA971973 
332947 

321763 W01148 
304424 AA2S3494 
303782 T64737 
326943 

324977 R14439 
325480 
327743 
333221 
3364% 

321583 H84421 
334191 
327089 

310001 F18939 
304056 R08577 



CH22JGBJES733-2 



23 

23 

2-3 

23 

229 

229 

229 

229 

229 

229 

229 

229 

229 

229 

229 

229 

229 



Ks228907 EST; WeaWy skrt3ar to GUANINE NUCLEOID 



ESTsngteton (not in IM^ne) with exon 228 

CH2?J=GBlEa777-1 228 

CHiJ3_hsgP04463 228 

Hs£9Q38 ESTs 228 

Hs.130051 ESTs 228 

Oi22_atAC005500.GB4K^N^3 228 

(mjGBIES^7W 227 

CH2LEMACC05500.GB^SCAM59-10 227 

EST singleton (not ki UraGene) with exon 227 

CH22J=GBES.47_1D 227 

EST duster (not tn UrdGene) 227 

EST stigieton (not In ttniGene) wfth exon 227 

EST duster (not hUniGene) with axon h 227 

CH21jBgi)6Q84446 227 

Hs209194 ESTs 227 

CR12Jragij5866957 227 

Ca05_hsgjj5867944 227 

CH22JGB*E&105J 226 

CH2^J=GOES^33_3 226 

EST cluster (not tn UniGene) 226 

CH22_PGENEa352_6 225 

CH21Jisgij6531965 226 

Hs.153827 ESTs 226 

EST singleton (not in UniGene) with exon 225 

324700 AW504745 Bs.103913 ESTs; Moderately similar to ID! ALU SUB 225 

330637 X86371 Hs£5659 tethaJgiarrf larvae (Droso^ 225 

307642 A1302103 EST singfeion (not in UniGene) with exon 225 

336985 CH22_PGBIES.402-6 225 

334425 CH22J=GBES384_13 225 

321216 AI078042 Hs.126691 ESTs 225 

315785 AW205946 Hs.150319 ESTs 225 

305809 AA853998 Hs.124580 EST 225 

331334 AA284858 rfc*89134 ESTs 225 

317131 AB91125 Hs.189109 ESTs 225 

CH2SJGENE&358J 224 

CK08_p2giJ567Q267 224 

CH21_hsgtj645o782 224 

CH22JK3ENES272_5 224 

Ks2Q2520 ESTs 224 

CH22_^AAC005500.GENSCAN.413-2 224 

Hs251354 ESTs 224 

CH2?_B«AC0(KJ097.aNSCAN.77-5 224 

Hs.147178 EST 223 

CH22_FGB4ES.411_16 223 

CH23JGBJE&326-10 223 

CH2ZJGBtESJ3S7JQ 223 

CftOIJs 91)5867493 223 

CH22_EJAAC005500.GENSCAN.41^ 223 

Hs.146817 EST 223 

ftsapiens dopamine D1A receptor gene, co 222 

CH.16Jsgij5867138 222 

CH22JXl32r10.GB^SCAN21^ 222 

317465 AW197361 Hs.131360 ESTs 222 

332458 M33493 Ks.184504 tryptasec alpha 222 

333195 CH22LFGBE&S8J7 222 

304337 AA587139 EST singleton (not in UnEGene) with exon 222 

307602 AJ288843 Hs231239 EST 222 

337078 CH2*_FGB<E&457-1 222 



228 



334216 
330330 
326923 
333774 

324311 AA443061 
33S551 

306716 AIQ24916 



300079 A1192520 
334617 



334495 
327301 
337856 

307072 AI150424 
330515 M85247 
32943 
338347 
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335862 CH22_FGENESiQ9L7 222 

301979 L28168 Hs.121495 potassium voflag&flated channel; Islwel 222 

335668 CH23JGB4E&590J9 222 

3050© AAS39818 EST singleton (not In UniGene) wffii exon 221 

329034 CaX_hs 935868561 221 

318403 AI131241 Hs.143234 ESTs 221 

320)58 CH.06Jisgii5902482 221 

335513 CH22J r GBCa571J8 221 

330803 AAQ04699 Hs.150580 putative translation inftafon bdor 221 

331427 H54764 Hs237339 EST 221 

338973 CH2?JU32I10.GB^SCAN27^ 22 

336723 CHZLFGENE&8W 22 

327290 CR01Jisgit5867483 22 

337240 CH22J r GENES.644-1 22 

306201 AA926818 EST singleton (not In UniGene) with exon 22 

303659 AA868464 Hs.126263 ESTs; Highly similar to RBRJLLARIN \Hjs 22 

334517 CH22J=GENES.399_7 22 

334169 CH22_F6ENEa3SL4 22 

335199 CH22_FGENES^08_S 22 

333705 CH2ZJ r GETffiS^)J9 22 

305794 AA845324 EST singleton (not tn UniGene) with exon 22 

303273 AA316069 EST duster (not In UniGene) witti exon h 22 

313384 W85694 Hs.118335 ESTs 22 

329158 CHJOagi|5868687 Z2 

337551 CH23J=GErE&847-8 22 

3287S2 CH.07Jsgip868309 22 

303737 AW502711 EST cluster (not in UniGene) wflh exon h 219 

324529 AW5Q2466 EST duster (not in UniGene) 219 

323103 Z45529 Hs.92030 ESTs 219 

333773 CH2£JGBES27£jl 219 

337906 CH22_EM^OQ5500.GENSCAN.56-19 2.19 

327129 CH21jHgil6531976 219 

305710 AA826544 EST singleton (not in UniGene) with exon 2.19 

335595 CH2^FGEN£S581_34 219 

323646 AA310925 Hs.154412 ESTs 219 

328368 CH.07_hsgi|5868388 219 

325802 CR14Jisg]|6552451 219 

337167 CH22J=GENES.562-27 219 

2050® AA635756 EST stogteton (not in UniGene) with exon 218 

321445 AW245524 rte.121590 ESTs; WeaWy sfrnSarto ZWC FINGER PROT 218 

332790 CH2^_PGENES^4 218 
336750 CH2?J=GENES.12&4 218 
310999 A1520706 Hs.171012 ESTs 218 
329798 CH.14_p2giJ6523160 218 
327012 CH21JisgJp867664 218 
304599 AA506638 EST singleton (not In UniGene) wifli exon 218 
335351 CH22J=GENE&539_4 218 
310661 AB54717 Hs223908 ESTs 218 

332791 CHZLFGENE&3J 217 
333022 CH2?J=GENES£5J 217 
310502 AJ458973 Hs.170422 ESTs 217 
324963 AA&53440 EST duster (not in UniGene) 217 
325275 CH.11_hsgi|5866302 217 
328338 CH.07J8 0^868377 217 
333063 CH2LFGBCS.75_6 217 
308895 AI858423 EST singleton (not in UniGene) wffli exon 217 
338685 CH2a_0AACQO55T)O.GB*SCAN.472-4 216 
32655 Cai4Jragtp867007 216 
332420 H49570 Hs.108074 ESTs; WeaWysimfortoCEREBELLIN 1 PRE 216 
337216 CH22JGENES.613-10 216 
335660 CH2JLFGENES^90_11 216 
337145 CH22J=GENES£42-2 216 
335753 CH2ZJ=GENES.604J* 216 
301766 R02224 EST duster (not in UniGene) wifo exon h 216 

303442 A1953998 Hs. 152510 ESTs; WeaWy similar to L-SERINE DEHYDRA 216 

311009 A1949701 Hs210589 ESTs 216 

307093 A1167606 EST singleton (not in UniGene) with exon 216 

300262 AI874402 Hs.170810 ESTs 2.16 

337989 CH22JMAC005500.GB1SCAN.112-7 216 

326263 Cai7_hsgi)5667264 216 

319402 W21298 EST duster (not in UniGene) 216 

321010 Y17456 Hs227150 Homo sapiens LSFR2 gene; last exon 216 

301706 AI929150 Hs241496 ESTs 216 

307412 A1241753 Hs241507 rfcoscmal protein S6 216 

335662 CH22_pGBca590J3 2.15 

332480 AA092932 Hs.12570 tubufln-specffic chaperone d 215 
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329273 CKXjBgP68762 2.15 

339383 CH22JA23ZE17£BfSCWO30 2.15 

3327% CH2*J5BIE&5J 2.15 

335227 CH22JGBESJ513J3 215 

32693 CR21Jisgf]645S782 215 

3S403 AA4241S9 Ha105529 ESTs;H^d/dn&artDCG^ protein [ Z15 

317786 AI859605 Hs.155586 ESTs 215 

326582 CK19JtsgP67318 215 

336494 CH23JGBC&832J1 215 

329655 CK14jJ2gil5448516 215 

307581 AE284415 EST slngJeton (rail hUniGene)wffliexon 215 

335570 CH2?JGBE&591_2 214 

332452 AA040369 Hs.11170 SYT Intaacfing protein 214 

309387 AW079943 Hs. 1561 10 bniramogbbufln kappa variable 1M 214 

308427 AI552677 Hs.195055 EST 214 

322027 NMJD04551 ESTctoer(notbiUniSene) 214 

301693 Z45023 EST duster (rxrfbiUniGeneJwffljexonh 214 

334308 CH23_FGBiE&373J1 214 

301131 AW134518 Hs.131807 ESTs 213 

338495 CH22_0AACOO55OOX5B4SCANJ87-1 213 

329600 CK10Lp29ip962481 213 

307980 AJ431698 EST angteton (rad In UniGene) wffli exon 213 

337260 CH2*JG9<E&652-15 213 

304655 AA527887 ESTangleto(notlnUniGen8)w9iexQn 213 

303141 AF195951 EST duster (not in UniGene) wffli exon h 213 

327957 Ca06jB955ffi8210 213 

334317 CH2Z_FGB*ES374J 213 

302870 AF011407 EST duster (not to UniGene) wifli exon h 213 

333808 CH22J=G9iE&278J2 213 

329947 Cai5_p2giJ5540101 213 

309602 AW162523 EST singfeton (not (nUnH3ene)*ah exon 213 

32279) AI700273 Hs, 122 162 ESTs; WeaMysinto to WAA0557 protein 213 

337706 CH2^_0AACOOOO97.6BJ«m87-1 1 213 

306894 AI092731 EST singfeton (not in UniGene) wffii exon 213 

325530 CR12JegIjS525289 212 

321087 AL11Q227 H&241533 Homo sapiens mRNA; cDNA DKFZp434J194 (fir 

309853 AW298169 H&57553 tousled*© khase 2 212 

326822 CU20jBgi)61 17831 212 

328776 CR07JisgP68309 212 

335112 CH22J=GENES.494_20 212 

334564 CH22J=GB4ES.4Q5_4 212 

333455 CH22J=^NES.157 4 212 

317395 R55044 Hs.124130 ESTs 212 

334221 CH2*_FGB4ES.360J 212 

331374 AA442134 Hs.70573 ESTs; WeaUysbnSarto HINT PROTEIN pi 212 

304473 AA428343 Hs.140 immunogtobuSn gamma 3 (Gm marker) 212 

328907 CH08Jisgp68493 212 

319448 R05539 Ks.108738 ESTs 212 

333676 CH22J=GB*E&247J3 212 

324767 AA630931 Hs34348 Homo sapiens mRNA; cDNA DKFZp434P0235 (f 

318585 Z43405 EST duster (not in UniGene) 212 

331732 AA251192 Hs.177708 ESTs 212 

329553 Oi10_p2 ^3962492 212 

336910 CH22JGB4E&3436 212 

326359 CH21jBg$469836 212 

305417 AA725228 EST singleton (not In UniGene) wm axon 211 

301573 AI150328 Hs2264Q2 ESTs; WeaHy similar to rratochondrial d 211 

326935 CH21JisgP»4446 211 

335176 CK2*JGB€S.504_6 211 

337210 CH22J=GB€&6035 211 

311284 AW027025 Hs239262 ESTs 211 

330240 Ca05_p2glj6671858 211 

327463 CH.<gJ)sgq6004455 211 

332938 CH22JGBIES.4U 211 

33Z785 CH22JGB£S.1J 211 

301035 A1358105 Hs.123164 ESTs 21 

305712 AA826701 ESTs^tetm(notinUn83ene)wffliexon 21 

318651 AW003150 Hs^40165 ESTs 21 

302753 M74299 EST duster (not fei UniGene) wffli axon h 21 

334635 CH22JOBfESAU^ 21 

313447 AA456745 EST duster (not m UniGene) 21 

301204 AW008544 Hs^39994 ESTs 21 

333950 CH2*/SBiE&303jS 21 

325947 CH.16_hsgP&7138 21 

337683 CH2^BVWC000097.Ga4SCAN76-1 21 

328962 CK(B J» 0^6456775 21 
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336655 CH2^F6ENES3« 2.1 

336596 CH2^PGBCS.163^2 2.1 

330486 M13755 HsX33 IntBfferofwfimuIated protein; 15 ©a 2.1 

314356 AA531607 Hs. 125143 ESTs 2X9 

314976 AA524725 Hs.162108 ESTs 2X9 

336650 CH23J=GENES.29* 2X9 

339025 CH22J5A59H18.GENSCAR22-6 2X9 

302395 AW297357 Hs.114606 ESTs 2X9 

323280 AJ910263 EST duster (not In UnXSene) 2X9 

338657 CH22JDJ32H0.GBISCAN.1-1 2X9 

335374 CHZLFGRIES.543J2 2.09 

30876S AI808510 EST sJngtetar (not In LMGene) with exm 2X9 

331027 N48584 HsX168 WAA0703 gene product 2X9 

337853 CH2^_ENtAC0055CO.GENSCAN37-1 2X9 

302493 NMJI02991 EST duster (not In UniGene) with exon h 2X9 

312607 A1337440 Hs. 169375 ESTs 2X9 

3143© Z44049 Hs. 184352 ESTs: WeaHystmfer to cDNA EST EKBLD3 2X9 

311695 AI142078 Hs.135562 ESTs 2X9 

333280 CH2^_PGENES.125^2 2X9 

333518 CH2ZJGBJES.173._3 Z09 

337199 CH22J=GENESS83-11 Z09 

337819 CH2^BtAC005500.GENSCAN.13^ Z08 

300546 AA214450 Hs.250913 ESTs 2.08 

322577 AA354452 Hs.59075 ESTs; Weakty similar to WD40 protein Cte Z08 

330)28 CH22J=GENESX72_1 Z08 

300238 AI394673 Hs.254030 ESTs Z08 

307429 A1243573 EST singteton (not In UniGene) with exon Z08 

326444 CH.19 _hsgl)5867385 2X8 

310641 AJ345597 Hs^54727 ESTs Z08 

337633 CH22_C20H1ZGENSCAfU2-1 Z08 

336008 CHZLFGENES.663.6 Z08 

339030 CH22J)A59HiaGENSCANJZ4-1 2X8 

333952 CH2?_FGENES.303J3 Z08 

329149 CHJOBgip8©685 Z08 

335192 CHZLFGBE&507JT Z08 

308225 A1557713 Hs.177592 ribosomal protein; large; PI Z08 

330519 M94172 Hs.69949 caWum channel; vottage^tependentLty Z08 

331809 AA402482 HsX7312 ESTs Z07 

324837 AJ003669 Hs.246171 ESTs Z07 

332608 D00749 Hs.36972 CD7 antigen (p41) Z07 

327291 CH.01J»gi)5867483 Z07 

315936 AW069807 H&247094 ESTs; Moderately amBar to 111! ALU SUB Z07 

317917 AI143593 Hs.129419 ESTs Z07 

328874 CHX7JtsgI|5868254 2X7 

338654 CH2^BAACC055C0.GB*SCAN.46O65 Z07 

320828 AJ012590 Hs.194728 hexos8-6-phosphato dehydrogenase (glucos Z07 

337896 CH2Z.B4AC005500.GENSCAN*3 Z07 

335310 CH22J=GBte&532J3 Z07 

300078 AW074835 Hs.145223 ESTs Z07 

303588 A1X46182 EST duster (not hUroGeneJwffli exon h 2X7 

328848 Ca07J»gQ6381921 Z07 

318723 C18060 EST duster (not in UniGene) 2X7 

335352 CH22_FGB<ES£39J5 Z07 

339316 CH22_BA354JtZGENSCAR22-15 Z06 

335873 CH22J=GENESJ>31_1 2X6 

335261 CH22_FGB1ES.52DJ> Z06 

322032 AL079807 EST duster (not In UniGene) Z06 

308771 AJ8093OI EST sngteton (not in UniGene) with exon Z06 

310024 A1252661 Hs.145224 ESTs Z06 

320555 R36212 Hs^35534 ESTs Z06 

319314 T74062 EST duster (not in UniGene) Z06 

334642 CH2?_FGENES.417J9 Z08 

335767 CH22JGENESX07J 2X6 

338159 CH22_FGENES.707_3 Z06 

336358 CH22_FGENESX18J 2X6 

334687 CHZLFGEICS.419 12 2X8 

339389 CH2LBA232E17.GBiSCAN.4-7 Z08 

335898 CH22_FG£NE&635_6 2X8 

328847 CHX7Jisgij6381920 Z08 

313431 W91884 EST duster (not hUnEGene) 2X8 

313270 AJ374993 Hs. 159611 ESTs Z06 

339211 CH2*JT113D11.GENSCAIt*6 Z06 

333860 CH2^J=GENES^90J9 Z06 

308X52 A1868157 H&224226 EST Z08 

305471 AA743947 EST singleton (not in IMGene) with exon 2X6 

300619 AA991438 Hs.233293 ESTs Z06 
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302982 AK93349 Ha228381 EST 206 

332446 AA112799 K&2387S ESTs;WeaMysiiitotountaKwn[asapfe 106 

33*972 CH22JGB4E&468J 205 

330196 CH05_p2giJB1 65140 205 

304754 AA579795 ESTsins^Qn(not!nUn£&i8)w»iexon 2j05 

309726 AW248521 Hs.195188 glpsalite^ 205 

333339 CH22_FGBffia301J5 Z05 

304836 AA507OO8 EST singleton (not in UniGene) with exon 2jQ5 

302087 AA324163 ESTchBfar(nothUriSaK)wfftewnh 205 

308424 AJ650714 EST singleton (not in UnHSene) wffi> axon 205 

304347 AA176914 EST singleton (not k\ UniGene) wfth exon 205 

333141 CH22J=GENESJJ5 1 £05 

310573 AW292180 Hs.156142 ESTs 205 

337565 CHZLC65E1.GENSCAN.1-11 2j05 

304295 AA084Q82 EST sfngtetoi (not In UnJGene) w5h exon 205 

326624 CH2DJisg!j5867553 2.05 

328443 Oi19Jisgp867385 204 

339012 OJ22J3A59H18.GSJSCAN.19-2 204 

337384 CH22J5BJES.745-1 204 

3323^ T79623 Hs.111787 ESTs 204 

303706 AW501525 EST duster (not lnUraGene)wffi» exon h 2.04 

336046 CH22JGENESJ57SJi 204 

301770 R05887 EST duster (not h IWGene) with exon h 204 

326726 Ca20Jsgip867597 204 

330485 M11186 Hs.113216 oxytoc^ prepro- (neiiiDphysfn 0 204 

332956 CH22JGENES.48 13 204 

300021 M97935 AFFXcontrot STAT1 204 

306872 AHB6920 ESTstno^(notlnUnX^)wfihexon 203 

302744 L03151 ESTduster(notfnUriGene) wift axonh 203 

338507 CH22^E?AAC00550aGENSCAN390-1 1 203 

334020 CH22J=GBC&317_1 203 

333870 CH22J=GB4E&291_3 203 

330552 U40223 H&248157 pyrihtiinaglc receptor P2Y;Gi»otMc 203 

335486 CHZLFGB4ES570 18 203 

339374 CH223A232E17.GB4SCAM2-5 203 

328364 CK07_nsgii5868392 203 

33460) CHZLFGENES.420J 203 

310318 AI733942 Hs.145338 ESTs 203 

325893 CKl6Jlsgip867088 203 

331373 AA435513 Ks.178170 ESTs; WeaWjr sfcnflar to DUAL SPECIFICITY 203 

329784 Cai4ji2g^912597 203 

335087 CH22J=GENES.488J1 203 

310582 AI336563 H&254585 ESTs 203 

332611 R06751 Hs.1600 cteperonin containing TCP1; subunlt 5 (e 203 

339258 CH2£_BA354H2GENSCAN.8-3 203 

336851 CH22LFGBIE&274-1 203 

305596 AA780664 Hs*734 ESTs: MateateTf statotollll ALUCLA 203 

330364 CHXp2g|3126882 203 

302940 AL137619 EST duster (not fn Un«^) with exon h 203 

317349 AA923657 Hs.126359 ESTs; WeaHy simferto till ALU SUBFAM1 203 

309869 AW300314 EST singleton (not In UnfGene) witn exon 203 

333422 CH2£JGENE&147J! 203 

325233 CH10JegJj6381943 203 

330586 U77968 H&79564 neuronal PAS domain protein 1 203 

336725 CH22_FGBJES*8-1 202 

334157 CH22J=GB4ES340.J 202 

303357 AW006352 Hs.159643 ESTs; WeaMy sfmiar to MLO [H^aplens] 202 

328533 CR07Jagip868482 202 

309210 AI982817 EST singleton (not toUniGeneJwHh exon 202 

327412 CK02J»g^867750 202 

333172 CH22J=GBCS^4J 202 

334869 CH22JGENES.447J 2L02 

301047 AA971465 Hs.116136 ESTs 202 

3S394 CRXJis $6478817 202 

301736 F12128 EST duster {not In UriGene) wifo exon h 202 

335581 CH22J=GENE&581_30 202 

338234 (W2£tMClXB8toGBt$CANm7 202 

334433 CH22JGENES.385.8 2.02 

334904 CH2?JGBIES.452_18 202 

318443 AI939323 Hs.157714 ESTs; Weakly stmBarto HBSR ACETYLCHOU 202 

300151 AI243445 Hs.189654 ESTs 201 

310348 AW78563 Hs.145519 ESTs 201 

310898 AM39868 Hs.165742 ESTs 201 

332860 CH2*J=GENE&27J 201 

301699 AB79117 EST duster (not In UniGene)wffl» exon h 201 

332554 W96450 H&23111 phenyfeianmatfNA synftetase-llte 201 
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327994 CK0SJistf586821B 2.01 

315613 AW137420 Hs.1 92311 ESTs 2L01 

335358 QU2JGB1ESMJ 2D1 

334028 Ctt£JGB<ES318_7 201 

335277 CH22JGBiE&523_3 2.01 

308657 AI749855 H&236497 EST; WeaWy simflar to GLANDULAR KALLIKR 2.01 

305913 AA876109 EST singleton (not in UniGene) with exm Z01 

323681 AW247730 Hs.102548 glucocorticoid receptor DNA binding fact 2XM 

333533 CH23J=GBES.175J0 2.01 

328753 CR07Jisgipe68298 2D1 

302337 L01694 Hi211523 guanine nucteofida bhtfing protein (G pr 2D1 

304643 AA526588 EST singleton (not tn UniGene) wfth exon Z01 

333085 CH22LFGENES.75 _8 2.01 

316192 AA904441 H&221286 ESTs 2 

302533 L36149 Hs248116 chemoWne (C motff) XC receptor 1 2 

312988 AA813689 Hs.123436 ESTs 2 

333612 CH2?^GENES217 B 7 2 

333615 CH2^_FGBiES217J0 2 

316085 AIQ27959 Hs.1 32300 ESTs 2 

337936 CWSLPMAC005500.GBJSCAN,85-7 2 

330972 H18467 Hs.1 18983 ESTs; Weakly similar to diaphanous 1 [H. 2 
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Table 20: B survivor vs Mets - Up in Mets 



Pkry: Uotgne Bna protect ttkatifici mi mW 

ExAccnr HflHirpT Tf A c c ession nunibo^ ^ Tn ilm ^ fl**^^*?* 1 number 



Urigecc Tiile: Untgenc gene tffie 

Pkey ExAecn UltiGJD CompteteTffie Ratio BS/Met 

316625 AA780307 Hs.122156 ESTs 0.28 

316076 AW297895 Hs.116424 ESTs 03 

315943 AA6S9756 H&117335 ESTs 038 

317198 AI810384 Hs128Q25 ESTs 038 

320082 AA487678 Ha.189738 ESTs 039 

313510 AI147291 Hs.154005 ESTs 039 

323683 AB8Q045 Hs225033 ESTs 039 

318558 AW402877 H&S0372 ESTs 04 

310264 AI915771 Hs.148867 ESTs 04 

314S45 AW276866 Hs.192715 ESTs 041 

313403 W86995 Hs.113157 ESTs 042 

321505 H73183 Hs.129885 ESTs 0.43 

312171 AW444619 Hs.138211 ESTs 0.43 

324585 AJ823969 H&13267B ESTs 044 

315595 AA803844 EST deter (not in UniGene) 044 

319818 AA825819 Hs.136952 ESTs 044 

337522 CH22JFGBJE&819-1 0.45 

324714 AA574312 H&245737 ESTs 045 

315060 AA551104 Hs.189048 ESTs 046 
300545 AJ026836 Ks.1 14689 ESTs 047 
304483 AA431441 ESTslngfeDn(RatlnUnCene) wSiexoi 0.47 
313096 AM22367 Hs.163533 ESTs 0.47 
306501 AA987294 EST singleton (not En UniGene) with exon 0.47 
329086 CRXJisgll5868604 047 
320176 AA167566 Hs.133325 ESTs 0.47 
320418 AJ674461 Hs.199638 ESTs 047 
302982 W92391 Hs. 198222 ESTs; WeaWy sfanBarto C2H24ype mc f 0.48 
315609 AW207535 H&224012 ESTs 0.48 
317056 AA904908 H&250643 ESTs 0.48 
314361 AUJ38765 Ha. 161304 ESTs 0.49 
315169 A1371390 Hs.158667 ESTs 0.49 
323743 AA324992 Ha257168 ESTs 0.49 
313903 AW167439 Hs.190651 ESTs 0.49 

315061 AA551196 H&188952 ESTs 049 
300969 AI140799 HsJ6230 rtoosomal protein S10 05 
331950 AA454595 H&99369 ESTs 05 
315076 AJ623817 Hs.168457 ESTs 0.5 
300975 A12B3548 Hs.149668 ESTs 05 
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TABLE 1-20A 



Table 1-20A, shows the accession numbers for those pkeys lacking unigenelD's for Tables 
1 -20. For each probeset we have listed the gene cluster number from which the 
oligonucleotides were designed Gene clusters were compiled using sequences derived from 
Genbank ESTs and mRNAs. These sequences were clustered based on sequence similarity 
usiijg Clustering and Alignment Tools (DoubleTwist, Oakland California). The Genbank 
accession numbers for sequences comprising each cluster are listed in the "Accession" 
column. 



Ptey: Unique Eos probesei idenfffier number 

CAT number Gene duster number 
Accession: Genbank accession numbers 



Ptey CAT Number Accessslon 

108446 112224J AA085383 AA126091 AA074174 AA075373 AA079120 AA07Q831 AA075978 AA075372 AA128503 
108474 116898J AA1 15179 AA079667 AA115897 AAD79771 - 
100635 10605.34 BE259G39 W29128 AW410299 X72990 BE246492 NM.005243 XB6899 AK09006 AW248151 AL031 186 AA012966 BE273549 
BE311429 BE2531Q2 Y07848 BE538102BE256863 BE261240 BE312156 BE618412BE257322 BE620446AW806629 ' 
AA376777 AA325384 BE&6808 BE251039 BE257878 BE275352 AA357169 AW403562 AA204995 AA093259 W95953 
BE256279 BE336683 BE252465 BE251266 AA380754 BE294942 AA380941 AA380999 BE297164 BE249995 BE294719 
BE295372 AI270673 BE305132 BE563752 BE295357 A1525421 BE263980 AA057505 AA020915 BE266318 BE206948 
AW74020 BE296420 BE297374 BE408545 BE019366 BE407372 BE266180 BE279437 R58233 T19567 BE300738 AW381 179 
AA357571 AW361285 AA436908 AA301019 AA301022 N20202 BE408777 BE548638 BE167415 AA071260 BB088429 
BE280092 W231 17 T19568 R51681 AW402216 W22784 BE1 85607 AM57224 BE544120 AL134874 S72620 AA375079 D51319 
AW818280 BE514686 AW853024 BE5B3744 AA300469T07592 BE622190 BE272834 W21781 BE315450 BE542367 BE393120 
AA988441 H55137 BE562296 BE622502 BE395950 AA329733 AA332348 AI76831 7 AA456866 AI497832 AW878437 AA857042 
U18018 BE621418 AI8187S0 AI949507 BE397693 AI885545 AI858854 A1355147 BE169028 S62138 AW732191 AA856B91 
BE266060 X71427 BE268557 AF09S890 AWD01288 AJ793S34 AI623498 AA071346 BE5476S2 BE261446 A1564543 BB59759 
U35622 BE314249 BE264915 AI638591 AI538385 AW090025 BE384754 AI888689 AW778800 AI925273 AA075797 AW949130 
AV660275 AW438697 AJ587137 AI524121 AA806249 AW628247 AA808241 AI244388 AI761125 AW1 17672 AA91 1782 
AI129250 AA654447 H55291 BE258050 BE206162 W95867 AA857187 AI871378 AI660103 AW1 03827 AI220929 AW149949 
BE465561 ABG2857 AW168841 D82190 AW249B14 AI623432 AI687358 AW951077 R51592 W60458 AKJ92863 AW474693 
D12765 AI91 1646 082208 082187 AW074031 AB58527 AW338497 AA970893 AW072573 AA205364 AJ858866 AA012830 * 
AW148763 A1863056 AA548656 BE250325 AI016994 AI864005 BE046122 AI497746 C75340 R58896 D82141 AW168240 
C19048 AI741090 D29465 AI222365 AA948288 AJ583522 AW572212 AKJ91290 AA582727 AA579897 AA570629 W60883 
AW516989 ALQ38160 AA577334 AI865B72 AA994043 AA922583 AA464778 AA209178 AI829479 AJ370235 BE246529 
AA384177AA456255AB99730W60654AU)35744AA862042R32ra 

AMI 9503 R32755 AW075358 AI432315 AA457024 AA020865 R92132 AA454629 AA746059 AA454643 AA456240 AA826984 
BE163738 A1BQ8470 A1991074 AI8G2560 AA587095 AA558714 AA968521 N87780 AB38246 N71794 AV561738 A13689Q3 
AA362570 A1989445 A16749S2 S75752 BE245204 AA975296 D201 23 AW005704 AA693328 AA582270 AJ918474 AW205707 
AJG96299 AA220990 AA101538 T29030 H27201 AW262526 A1610530 AA1 26840 AA126790 X92120 AW367868 8E299644 
BE299451 AA476561 BE300O44 AA134363 BE295222 AA307504 N42337 AA319098 N395Q2 AW984461 N57241 BE299049 
N86332 R51 156 AA085859 T75212 AA133S39 AA147129 AA1561 61 BE543953 BE538848 AA133676 BE299745 AA13505Q- 
AA218535 AW406401 AW411287 BE410528 C01410 NMJ04083 BE314959 AA836413 AAQ85862 AW024370 AA471059 
AW467508AA001025AJ828231 AA633221 T95517 AA147038 AA476447 AW027012 AW078627 BE513200 AI192297 
AA886279 AW081806 AA3161B5 AA010506 AG69929 W93139 A1682935 AA609555 AA378028 A1093877 AA999997 AA730698 
A1143923 AW575315 AA890550 AA494353 AW576601 AI7S6336 AA826130 AA609207 AJ539618 AI088539 AI089090 AA8255D5 
AA632978 AA01 5892 AW204713 AA156495 AA824613 AA133630 N29826 AA527476 A1533352 T27908 AA134364 AA133940 
AW043601 H37775 AA772375 AA057871 AA047888 AA054225 H86568 AA001511 H25718 AW189507 AA165589 AA054433 
H85549 AA165486 AA05B972 AA45491 1 AA464064 AA493802 AA428253 RB5508 AW302469 A161 1812 BE162582 F1 1073 
T95518 N26811 AI763929 H40669 AW611745 AI656803 R51042 R45276 AA528386 AA782S75 AW880218 AL138391 AA314536 
AW949338 AA149466 AA149552 AI346513 AA216776 BE349131 AW007654 AI141803 AA622688 AI185131 AW057635 
AA1 01539 AA627986 HZ7202 AI5S5847 W93084 AI973148 A12467B8 AW572108 AI469414 AA454835 AA612707 AA430746 
AHI84991 AA010400 AA856636 AA463928 AI248310 R07170 AA834033 012244 AI655670 AA054350 AA639480 AI702J67 
AI475389 

100643 3931.1 NM.005032 M34427 AA332167 AW409711 AL1 19718 BE297581 BE299855 AAQ82284 AA226855 AA149568 AW391953 
M22299 BE163594 AW847881 AW366993 BE142871 AW847885 AW604137 AW847753 AW847886 AW376442 U48350 
AW607478AA373011 AA334Q80 BE294177 AL121355 AA3Q2236 BE540666 BE170588 AA346884 BE541512 AA226818 
AAQ82001 AA366490 AW604122 AA205784 AW607791 BE168496 AA058497 T64373 BE1 65633 AW802804 AW847878 
AAW408 AA086397 AT751745 AA344103 AA034463 AJS06008 AA363580 AA379193 AJ332642 AH43569 W5274S W5Z754 
AA385532AA085967 F05943 AA363422 AA133444 AA133477 AA029541 N48387 N83348 AA376066 AA147671 W70187 
AA316255 BE174987 AA452776 AA089605 AL047776 BE162673 H39532 BE168406 AA357654 AA328728 AW813442 057844 
AW839748 AW839663 D57357 AA334536 AW268874 AW950788 AW409888 All 60544 057821 AW664382 D25884 AI7551 01 
AW130365 AI6C9094 AJ984064 AB06523 AA492516 AI755258 BE1 5721 0AA374 884 AI983923 AI831088 AA706501 AI754957 
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100673 21517.2 



108559 41469 9 
108569 118606J 
100700 17137.1 



A!688651AK)88623At336114N38752 756004 AA84520OAA858OTBE157397AVV0fi9347AAD45366AV»16918AWa^ 
AB55398 BE157396 AI751746 AJ375820 AA129935 W500Q2 N24781 AI805924 W5Q009 AA044283 AA121 161 AB39Z77 
AA301885 AW01S944 AA133445 AA1011C8 AA033559 W70GS0 AA617751 AB86261 AI023234 D82235 AAQ35846 AW754181 
D82093 D82100 AA1476S3 AA6Q Q2S D57884 AI753382 A1568Q50 A1146490 AW3O2280 Ai4330S1 AA329188AW572150 
AW166345 AB37981 AA778973 N67577 AA2272B7 AA838281 C06190 AUM6997 ACf7662 AI752979 AW627538 AH27171 
A1440461T64184AA845190A/\227111AA877394R60962A^ 

D57Q26 T17158 AA57B545 AWC85C32 BE14S939 AW8150S9 BE1 52843 BE149068 BE149036 AW815073 AW753691 BE14S04Q 
AW815CS5 BE152842 BE149072 AW753692 AW8150S BE152837 BH152849 BE152840 AW815070 BE152B29 BE152846 
B E1489 72 BE149042 BE149074 AW7536© BE152832 BH152841 BE14S082 BE149(50 AA347261 BE152852 BE152347 
R65797 FQ2189 AA483448 AB54410 AA865375 BE152836 BE152838 BE 152839 T17300 BE152844 BE152833 BE152834 
AAQ29542 A1567601 AD62353 BE1 62140 AI381 384 BE152851 D57038 057043 AI4 18363 AA133478 BE149051 BE149083 
BE152850 BE149052 BE149084 AA886686 BE14S064 BE149032 AA044093 AA129934 AA303976 BE 15721 1 AA187291 
BE152830AA046552 BE149047 BE149079 BE149Q33 BE149065 BE149044 BE149076 BE 149053 BE149085 BE149034 
BE149C68 BE149048 BE149Q80 BE149038 BE149070 BE149045 BE149077 
100670 22023J AA332178 BE259177 BE545625 TO105 S62076 M16424 NMJMQ520 BE244309 F13516 BE251567 BE514981 AL119537 
AA336739 BE261B01 AA278642 N32708 T77034 W24621 W42478 AW630382 AWB56214 AA134234 M13520 BE379212 
AA287459 BED19379 BE297192 BE162970 AW4QSB8 AW4CB322 BE272280 BE208703 BE304428 BE162807 BE162828 
BE162887 BE07B944 BE1630S BE162878 BE162909 BE162898 BE162791 BE162880 BE073563 BE153C86 BE162B96 
BE162770 BE073565 BE 162906 BE162913 BE 162347 BE162B03 BE262199 BE1 62811 BE080697 BE315C95 AW206024 
AI291054 BEQ87364 AL046839 AA3Q4422 AA847660 AA669376 BE392765 AI567793 AWQ26644 AW151258 AA996314 
AIB28660 AI571158T61941 AW103SJ3 AW172698AI923115AI823709T62167AW771381 AW151782 AI799284 AW242Z71 
AA128031 BE261 306 BE312241 AB74880 BE261057 AA630684 AA831305 A1139546 AW082447 AA916854 AA916855 TBS970 
AA599395 AA921 680 AB44674 AI041 320 AA424998 AB62999 W42543 T5 1260 AB62486 AI6SS3GS A1827925AI027381 
AJ027370 AEQ9049 AA782220 AB34014 AI279051 AI21771 1 AI674210AH93370 AI701683 123782 AI927545AI784291 
M128007At370630AI972736AA853763 N92379Ai916746AA639633AASa7603At4794^ 

AA563654 AA745291 AW089417 F10858 AE54227 R38108 AI668647 AA994088 AI74Q910 AW880973 AI739410 A1480346 
N78987 A1473892 BE1 62903 BE254430 BE260426 AA65C012 AWC06426 

AW403342 AW248986 BE561709 AA357312 BE311834 BE389496 BE34887 AW732696 BBQ47868 AI7Q2383 BE019155 - 
AI702367 BE408966 BE280458 BE313759 BB13492 BE5354W BE28G258 AC005263 NM_0O7165 L21990 AW73Z711 - 
AK64920 AVV249094 6E265365 AW507186 AW877346 BE005217 H2721 1 U46230 BE280O66 BE207O43 BE546782 AW24tt59 
AA085228AAQ85161 

AA082885AA114265AAD85398AA113184 

AA932794 BE540417 AW4098Q2 AW410765 BE296651 BE294197 BE164813 AW381886 AW381B08 AL048654 AW403058 
BE207228AA464654AW966967AA326831 BE407Z77BE408669 AA4765Z7AA1 15576 AA359697AA476357AA449939 
BE263719 AL045304 W21442 R28919 BE395990 AA252273 AB4681 2 BS38487 AA507160 W93950 N42025 AI088439 
AI134931 AA031524 AS872B7 AW470017 AA476423 AA464553 AW410766 AI569421 AA577476 AE48935 AI912371 
AW6 1 5574 AA824237 AA807746 AA827377 AA83Q268 AA4763C9 A1086424 AW409638 AAQ31553 AW451 901 AB20934 
AW050554 R49825 N30302 AA532541 W94004 N934Q2 AA1 15549 AA331 202 L25655 NMCQ5275 AA436745 AI122671 R49779 
R18S08 

100734 35197J AI807481 AB00404 AJ092260 BE3489S2 AI143675 AA7723S9 AA772398 AB68565 A13791 72 A1083781 A1871363 AA343793 
NM-000141 M87770 X52832 M55614 271929 W05259 AA548551 AW98743 BE081295 BE162251 F05643AI1 27918 H83199 
W07463 BB51725 R28404 AW206461 AW590506 AI885536 N69800 R93496 R25381 AA443093 AI143C83 A1284647 AI703144 
AA309032 AA309031 AW949426 AW949428 AI638387 AI638356 R77173 R38513 T89263 F01900 H37979 R7Q205 AA664355 
AA235910 AAQ33657 AI436212 N50463 AW33805 AH81876 AI421Q90 AW02081B A1559529 AIB87420 BE154202 AA889062 
AA723410 BE537575 AA236812 AC18552 AI264866 AC90617 AA424365 AA424505 AW073347 AA032183 AI142488 N55322 
AI884363 AI336070 N67307 AA608928 T94993 AW514184 AA724695 N66630 AB79638 AJ274671 AW62B470 AA235346 
AA687581 AI073906 AI2636Q2 AI8691 1 1 A1805693 AM23808 AJ076491 AJ374640 H82967 AA776567 AA256191 729856 
AA953586 AI140801 AI805484 AA984329 R93497 AI017114 AI263355 N81 103 AW41B776 D57474 AI918460 AA256152 
AI683268 AKW2628 AW196650 C00195 AI91 8567 T28903 N95383 R13671 T94939 AI275235 AA235751 T84335 

100739 2738J M59287 L29222 AGS 1890 BE244986 AI708332 

117040 46956 J AW970600 AA503323 H89218 AF086031 H89112 

100748 41861J XD6096X05826 

100760 1334J AW794626 M27126M27014 

100779 458J27 BE561958 BE561728 BE397612 BE514391 BE269037 BE514207 BE562381 BE514256 BE514403 BE514^0 BE397B32 

BE269598 BE559885 BE3S6881 BB60031 BE514199 BE560037 K560454 ^ 

100787 458J27 BE56195B BE561728 BE397612 BB14391 BE2B9037 BE514207 BE562381 BE514256 BE514403 BB14250 BE397832 
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335755 CH22_3122FGJB04_4JJNKJBM 
328544 cl7J« 

335768 CHZL3137FG_607JUJNK_EM 
305688 AA812728 
328552 QjTJb 

335774 CH23Jt43FG_607J0JJNK_E 

328557 c_7jw 

328558 qJTJb 

335777 CH22J146rej07J3JJNKJ= 
305697 AA814956 

335782 CH2?_3151FGJ09_4J.INK-^l 

335783 CH22_3152FG_610 JJJNK_EM 

328569 cJJa 

335787 CH2^156FGJ11JJ.INK_EM 

328570 oJJ» 

328581 cjja 

328582 cJJs 
328592 C_7J» 

337011 CH23J876FGJ27JL 
337023 CH2Z.4894FGJ33l.12_ 
337032 CHZL4910FG.438X 
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337089 CH22J957F6 
3#Q92 CH22JS01GFQ 465 1£_ 
3370© CH22J017FG~465J8 
337094 CH2L501ff(3L465_19" 
337097 CHZL5Q2BFG 47U " 
305700 AA815428 

3358CS Cffi^J517a=GJB15JBJJNK-Pyi 
335817 CH2?_3189FGL618_5JJN^Bfl 
32B807 qJjB 

335827 CHZLSaOOFG.620LlJJNieBWI 

335831 CH2?J204FG_.62D_5JJNK34 

335832 CH2^3aKFG_62D_6JLiNK_afl 
328620 cjjts 

328824 qJJjs 
328838 c_7Jb 

335863 OT2* J 3Z38FG_629JULINK_Sy| 
305782 AA844730 
305787 AA845035 
328662 q_7Jjs 

335895 CH2^3272FG_635JJJhBeai 

3371(H) CH2*_SQ31FG_472J3_ 

337114 CH2*_0O6QFG 494J7 

337121 CH2^.5096FGJ519_1_ 

337132 CH22_5112R3__52S_3 

337168 CH2Z5188FGJ352J8__ 

307085 AH60868 

337170 CH2^5190FGJ564 1 

337172 CH22_5192FG_565i 

307090 AH61Q24 

305803 AA846052 

305808 AA853958 

305818 AA854776 o 

335902 CH2*_3279FG_635J0JJNK^ 

335920 OB^3297FG_636_16JJNK£ 

305841 AA860348 

305867 AA864572 

335956 CH22J3334FG_647J3_UNKJDJ 
305877 AA865649 

335968 CH2*_3347FG_652JJJNKJ>J 
335971 CH22J3350FG^65Zj4_lihIK_PJ 
335975 CH2^_3354FGL652J9JJNK-W 
335980 aU2^3360FGJS3JJM<JU 
3 28768 cl7Jb 
328770 qJJb 

335993 CH2^3373FG_656JLUNKJDJ 

335998 CH2?_3379FG.656J6JJNKJ> 

335999 CH23JJ380FG.657J JJNKJW 
328791 Qj_hs 

337203 CH22J&6FGJ591JL 

337204 CHZL52B1FG_$95J_ 
307120 AI184343 

307140 AH85762 
307177 AJ188864 
305903 AA873085 
305925 AA877883 
328603 c_7 Jts 
330002 c16_p2 
328810 cJJs 
328620 cjjxs 
330021 d6_p2 
305SS7 AA886428 
328835 CJJB 
305971 AA886874 
328841 QjJe 
305984 AA887654 
328851 QJJb 
328859 Cjja 
330057 d7j2 
33)058 d7j2 
3069® AA889603 
307215 AH93189 
307262 AGQ2100 
307318 A1208577 
307380 AB22985 
307433 AE44895 
307437 AI245683 
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307556 AE81S51 



337645 OCL5960R3 JJNJCBtACOO 
337657 Ott?J976FG JW-EMACOO 
307574 AC83549 



307532 AB85739 

337685 Oi22.6020TC_JJhBC©ftAC00 
337685 C^6033FG_UNK^tAC00 
337697 CH2^6(H7FG JUNK.EMAC0O 
3070)6 AQ90006 



307629 
307642 
307643 
307646 
307703 
3 07728 
307752 
307799 
309005 
3 07864 
307877 
307899 
309108 



309181 
307904 
307912 
307918 
307954 
307981 



309206 
309233 
309245 
309247 
309275 
309324 
309333 
309343 
309351 
309403 



AI30Q246 
A1302103 
AI302124 
AJ3Q2236 
AJ318588 
AJ335557 
AS339447 
A051112 
AI884454 
AC67417 
AI368880 
AI380270 
AI925949 
AI949216 
AI951727 
AJ381019 
AB82224 
A1383496 
AM19692 
A1421059 
AM34166 
AI961962 
A1971416 
AB72447 
AIS72768 



325271 
325285 
325289 
309600 
309605 
309759 
309767 
309815 



302681 



309958 
309977 
309985 
302727 
302747 
304022 
304055 
304056 
304060 
304125 
304127 
304134 
304195 
302952 



304269 
304324 
304330 
304424 
30446 



AW015373 

AW025709 

AW028652 

AW057547 

AW082954 

AW151131 

AW172384 

CllJtS 

ClIjB 

Cl1J» 

A W1 82066 

AW182800 

AW268822 

AW271805 

AW292760 

AW298760 

31257.1 

31326J 

Ai MAAAAQ * 

nirvl'rtOQ 

AW451663 

AW452919 

32493J 

32B13J 

T02990 

R07994 

R08577 

T61464 

H40976 

H42981 

H54627 

N35382 

39444.1 

4119QJ 

AA069029 

AA137045 

AA157834 

AA293494 

AA421948 



X97550 X97552 
X85153T63701 



L10141 L10151 L10148 L03095 
ATO62275L03830 



AF103179 U82961 
AF054663AF124197R7O292 
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304467 
304480 



304487 
304559 
304575 
304605 
304612 
304623 
304635 
304667 
304674 
304675 



3046SS 
304707 
' 304731 
304734 
304745 
304746 
300009 
306012 
306053 
308081 
306090 
304811 
304813 
304817 
304833 
304841 
304887 
306137 
306180 
306163 
305193 
304918 



AA424703 
AA430373 
AA434076 
AA434241 
AA488050 
AA496437 
AA513225 
AA514207 
AA521331 
AA523976 
AA535602 
AA541735 
AA541740 
AA554263 
AA554756 
AA564846 
AA576085 
AA576428 
AA577771 
AA577793 
AA894560 



306200 
306220 
306221 
306300 
320789 
306351 
330435 



AA905312 

AA908472 

AA908609 

AA584361 

AA584540 

AA584712 

AA586504 

AA587541 

AA599355 

AA916176 

AA922503 

AA922622 

AA923457 

AA602697 

AA614308 

AA926816 

AA928363 

AA92B686 

AA937573 

252516L1 

AA961356 

41165J 



3304% 10605J34 



R78712M603646R7B713 

U63TO AW842139 X74956 U78550 AW840802 X74954 AW388241 AW842709 AF253321 X74955X74370AW363799 
BE073386 AJ791 962 AA587390 AW84C865 

BE259039 W29128 AW410299 X72390 BE246492 NM_QQ5243 X66899 A1909006 AW248151 AL031186AAO12966BE273549 
BE311429 BE2531Q2 Y07848 BE5381028E256863 BE261240 BE312156 BB618412 BE257322 BE620446AW806629 
^^AA325384BE256808BE251039BE257878BE 

BE256279 BE336683 BE252465 BE2512G8 AA380754 BE294942 AA380941 AA380999 BE297164 BE249995 BE294719 
BE295372 AI270673 BE305132 BE563752 BE295357 AB25421 BE263980 AA0575Q5 AA020915 BE266318 BE2Q6948 
AI474020 BE296420 BE297374 BE408545 BED19366 BE407372 BE266180 BE279437 R58233 T195S7 BE300738 AW381 179 
AA357571 AW361285 AA436908 AA301019 AA301Q22 N2Q2Q2 BE408777 BE548638 BE167415 AAO71260 BE0B842S 
BE280092 W23117T19568 R51681 AW4Q2216 W22784 BE185607 AM57224 BS44120 AL134874 S72620 AA375079 D51319 
^!^°. BE51468S AW853Q24 BE563744 AA300469 T07592 BB622190 BE272834 W21781 BE315450 BE542367 BE393120 
AAM8441 H55137BE562296BE622502BE395960AA329733AA332348AI7683l7AA4568tt 
U18018 BE621418 AK18790 AI949507 BE397693 AB85545 AI858854 AI355147 BE169Q28 S62138 AW732191 AA856891 
BE266pX714Z7BE268557ARB5890AW0 

185622^14249 BE264915AI638591 AB^385AW090025BE384754AI888689AW778800 AJ925273AA075797AW949130 
AV660275 AW438S97 A1587137 AI524121 AA806249 AW628247 AAB08241 A1244388 AI761 125 AW1 17672 AA911782 
M129250 M654447 H55291 BE258050 BE206162 W95867 AA857187 AI871378 AI660103 AW1 03827 AE220929 AW149949 
BE465561 AI302857AW168841 D82190 AW249814 AJ623432 AJ687358 AW951077 R51592 W60458 A1Q92B63 AW474693 
D12765 AI911646 D82208 D82187 AW074031 AB58527 AW338497 AA970893 AW072573 AA205364 AI858886 AA012830 
AW148763 A18S3056 AA548656 BE250325 AB16994 A1864005 BE046122 AI497746 C75340 R588S6 D82141 AW168240 
C19048AI741WD29465AI222365AA948288AI583522AV^^ 



AI6195roR3Z755AW075358AM32315AA457(E4AAa20 

AI8C6470 A* 991074 A* 802560 AA587 ^® 5 AA558714 AA968521 N877B0 AI538246 N71794 AV661738 AJ368903 
M3j^AI989445Al574962S75762^^ 

A1G56299 AA22Q990 AA101538 129030 H27201 AW232526 AI610530 AA126840 AA126790 X92120 AW367868 BE299644 
BE299451 AA476561 BE300044 AA134363 BE295222 AA307504 N42337 AA319098 N39502 AW964461 N57241 BE99049 
K86332 R5115S AA085859T75212 AA133939 AA147129 AA156161 BE543953 BE538848 AA133676 BE299745 AAlSrsn 
^8^AW4C64tt^^ 

AW46^AA0010KAI828231 AA633221 T95517AAM7038AA476447AW027012AVV078K7BE5132D0AI192297 
AA8^AVTO1B08AA31^^ 
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AA632978 AA015892 AW204713 AA156495 AA82481 3 AA133630 N29826 AA527476 AIS33352 T27908 AA134364 AA133940 
AWD43HH H37775 AA772375 AA057871 AA047888 AA054225 H86568 AA00151 1 H25718 AW189507 M1655B9 AA054433 
H85549 AA165486 AAD58972 AA454911 AA464064 AA4938Q2 AA428253 R85508AW302469AI611812BE16S82F11073 
T95518N26811Af783929H4CS69AW1t745Al 

AW949338 AA149466 AA149552 AI348513 AA216776 BE349131 AW007654 AI141B03AA622688 A1185131 AW057635 
AA101539 AA627986 H27202 A1536847 W93084 A/973148 A1246788 AW572108 AM69414 AA454835 AA612707 AA430746 
AKJ84991 AA010400 AA856636 AA463928 A1248310 RD7170 AA834033 D12244 A1655670 AA054350 AA639480 AJ702C57 
AJ475389 

330527 14658.-15 S77356 

330833 103550J AAD46804 AA046821 

330855 111881J AA070316AA079318 

332099 genbanUUV608983 AA608983 

332240 genbanJLN54803 N54803 
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TABLE 1-20B 

Table 1-20B, shows the genomic positioning for those pkeys lacking unigene ID's and 
accession numbers in Tables 1-20. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
listed 

Ptey. Uriquammtecoitttyqnflng to an Eos probeset 

Ret Sequence source. The7aTgftmin*erahtewta T)unhamLetai'refastotteoubCcafion 

enffledTheDNA 

sequence cfhunsnchnHiiosome 22." Dunham I. etaL, Nature (1999) 402:489495. 
Strand: Indicates DNAstrand fora which excnswerepnxDcfed 

NlposSon: Irefk^nudeoildepos^ofpfBdldBdexo^ 



Ptay 


Ref Strand 


Nt position 


332792 


LnjJLitaJii) 


1 of al Dfitc 


73381-73768 


332843 


Dunham, 




1142859-1143494 


332909 


Dunham, 


I of nl Dhre 
L Itl ell, rutS 


1346582-1346735 


332920 


LHJJlflaill) 


u cLai rlUS 


2007562-5007785 


332947 




Lcua. rHJB 


2431725-2432006 


332949 


uunnam, 


Letal Plus 


2436245-2436348 


332958 


Dunham, 


L f&JcA. Plus 


25161312516310 


332992 


niinmm 


L &J3L PUIS 


26999972701093 


332993 
333C04 


Hinham 
vuukilll. 


1 otal Pfatc 
u ULgL rWS 


2701550-2701685 


Dunham, 


1 Affil Dtiie 
I CuB. rlUS 


275905B275jH65 


333006 


UuiUuITl, 


L etaL Puis 


27628532762953 


333007 


OitnKam 

uuiujsan, 


L tA.sH Phis 


27635692763709 


333132 




l etai. rtus 


3358040-3358153 


333133 




I etaL Plus 


336005^3360195 


333139 


Dunham 


pf al Phie 


3369495-3369571 


333152 


Dunham, 


U Dial rlUS 


3812171-3612354 


333205 


Dunham 


I* cm r iuo 


3342727-3943009 


333221 


Dunham. 


U WiOi» r 1113 


3978070*3978187 




Dunham 


pfal Pfcie 
u PfcoL i HA 


3992229-3992386 


333245 


Dunham, 


Letal Phm 
l wucil riua 


4157587-4157668 


333248 


Dunham, 


• plqii nua 


4162041-4162139 


333261 


Dunham. 


1 rial PhK 


4336597-4337752 


333272 


Dunham,! 


pfal phis 


4381561-4382212 


333281 


Dunham, 


piaf pfus 


45082304506342 


333283 


Dunham,! 


etai Pius 


451422&4514360 


333288 


Dunham, 1 


.etaL Plus 


4516841-4516939 


333298 


Dunham,! 


.etaL Plus 


4581537-4581947 


333306 


Dunham, 1 


.etaL Plus 


539623^5396310 


333382 


Dunham, | 


.etai Plus 


490579&4905913 


333403 


Dunham, 1 


.etaL Plus 


49251404925256 


333420 


Dunham, 1 


.etaL Plus 


49543024954465 


333428 


Dunham, 1 


etaL Pftjs 


49738694974007 


333464 


v Dunham,! 


.etaL Plus 


5210762-5211300 


333465 


Dunham, | 


.etaL Plus 


5211385-5211858 


333488 


Dunham,! 


.etaL Plus 


5396233-5396310 


333515 


Dunham,! 


etaL Plus 


556429^5564851 




Dunham,!. 


etai Plus 


5586133-5586296 


333566 


Dunham, L 


etaL Plus 


5954226^954473 


333567 


Dunham, L 


etaL Plus 


59591394959515 


333571 


Dunham, L 


etaL Plus 


6007916-6008058 


333572 


Dunham, L 


etaL Plus 


60268964027189 


333576 


Dunham, L 


etaL Plus 


60903454090721 


333577 


Dunham, L 


etaL Pus 


61239504124281 


333580 


Dunham, L 


etaL Plus 


614293&6143145 


333587 


Dunham, L 


etaL Plus 


6250599-6250966 


333588 


Dunham, L 


etaL Phs 


6255445^255779 


333591 


Dunham, L 


etat Puis 


62858844286251 


333592 


Dunham, L 


etaL Plus 


6297731-6297976 


333593 


Dunham,!. 


etaL Plus 


6304132-6304428 


333594 


Dunham, L 


etai Pus 


6308990-6309450 


333599 


Dunham, L 


etaL Plus 


6337885-6338255 


333600 


Dunham, L 


etaL Pus 


635562*6355925 


333601 


Dunham, L 


etaL Plus 


636007^6360442 


3338)7 


Dunham, L 


etaL Rus 


6504431-6504690 


333608 


Dunham, L 


etaL Plus 


65108344511130 
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333819 


Dunham, L etai 


Plus 


6562799-6562926 




Dunham, LetaL 


Phis 


65845594584956 


333625 


Dunham, L etai 


Plus 


660302063)3310 


333626 


Dunham, LetaL 


Plus 


6614174-6614467 


333627 


Dunham, LetaL 


Plus 


66205846620903 


333628 


Dunham, L etaL 


Pte 


66290046629233 


333629 


Dunham, I. etaL 


Plus 


6636915-6637205 


333631 


Dunham, L etaL 


Plus 


66509046651011 


333632 


Dunham, LetaL 


Pte 


66515206651658 


333635 


Dunham, t. etaL 


Plus 


6663683-6663973 


333637 


Dunham, LetaL 


Plus 


66749666675134 


333640 


Dunham, I. etaL 


Plus 


66883506688624 


333642 


Dunham, LetaL 


Phis 


6708760-6709139 


333643 


Dunham, L etaL 


PblS 


67280536728343 


333646 


Dunham, L etaL 


Pte 


67391106739379 


333647 


Dunham, I. etaL 


Phis 


6772502-6772779 


333648 


Dunham, L etaL 


Plus 


67874656787782 


333650 


Dunham, LetaL 


Pte 


6796852-6797128 


333652 


Dunham, I. etaL 


Plus 


68094556809573 


333653 


Dunham, LetaL 


Plus 


68111306811392 


333654 


Dunham, L etaL 


Pis 


6816731-6816993 


333656 


Dunham, I. etaL 


Phis 


6822087-6822406 


333657 


Dunham, L etaL 


Plus 


68313696831445 


333658 


Dunham,!. etaL 


Pte 


6835282-6835474 


333668 


Dunham, I. etaL 


Plus 


70110097011223 


333670 


Dunham, I. etaL 


Plus 


7027945-7028181 


333680 


Dunham, L etaL 


Plus 


707173O7071794 


333682 


Dunham, LetaL 


Pte 


70766417076760 


333698 


Dunham, LetaL 


Pte 


7205279-7206383 


333710 


Dunham, L etaL 


Plus 


72303147230476 


333717 


Dunham, LetaL 


Pte 


73087147308815 


333727 


Dunham, LetaL 


Pte 


73732107373311 


333785 


Dunham, I. etaL 


Pte 


77753177775415 


333791 


Dunham, Letal 


Pte 


77959727796082 


333859 


Dunham, I. etaL 


Pte 


80412036041359 


333875 


Dunham, LetaL 


Pte 


81355056136179 


333879 


Dunham, LetaL 


Pte 


81469196147062 


333891 


Dunham, I. etaL 


Pte 


8156437-8156709 


333918 


Dunham, I. etaL 


Pte 


83071246307215 


333929 


Dunham, LetaL 


Pte 


84794866479580 


333932 


Dunham, LetaL 


Pte 


84891246489205 


333983 


Dunham, LetaL 


Pte 


8813593-8813668 


333987 


Dunham, L etaL 


Plus 


88242458824376 


333997 


Dunham, I. etaL 


Plus 


8866668-8867255 


334010 


Dunham, I. etaL 


Pte 


69966968998236 


334015 


Dunham, LetaL 


Pte 


9055452-60555% 


334017 


Dunham, LetaL 


Plus 


91395166139634 


334026 


Dunham, LetaL 


Pte 


91965406196681 


334030 


Dunham, L etaL 


Pte 


92884636288782 


334044 


Dunham, L etaL 


Plus 


93738966374065 


334047 


Dunham, LetaL 


Pte 


9428152-8428211 


334055 


Dunham, LetaL 


Pte 


98)2077-8662270 


334063 


Dunham, LetaL 


Pte 


9731991-9732085 


334066 


Dunham, 1. etaL 


Pte 


97395686739680 


334068 


Dunham, LetaL 


Pte 


97462706746477 


334076 


Dunham, LetaL 


Pte 


98016136801693 


334091 


Dunham, LetaL 


Plus 


9872327-8872527 


334106 


Dunham, LetaL 


Pte 


10261155-10261841 


334109 


Dunham, 1. etaL 


Pte 


10267679-10267864 


334111 


Dunham, L etaL 


Pte 


10279365-10279531 


334115 


Dunham, L etaL 


Plus 


10316414-10316608 


334118 


Dunham, LetaL 


Pte 


10344273-10344384 


334120 


Dunham, LetaL 


Pte 


10402389-10403196 


334135 


Dunham, LetaL 


Pte 


10457085-10457183 


334235 


Dunham, LetaL 


Pte 


12983601-12983703 


334239 


Dunham, Letal 


Pte 


13056569-13056693 


334244 


Dunham, LetaL 


Plus 


13159198-13159302 




uunnam, i. etai 


Ohm* 
PIUS 




33420) 


Dunham, LetaL 


Plus 


13223819-13223968 


334298 


Dunham, LetaL 


Pte 


13424763-13425914 


334342 


Dunham, LetaL 


Pte 


13646844-13646980 


334354 


Dunham, LetaL 


Pte 


13702598-13702747 


334430 


Dunham, L etai. 


Pte 


14269664-14270102 


334435 


Dunham, LetaL 


Pte 


14275597-14275689 


334451 


Dunham, LetaL 


Plus 


14315572-14315741 


334504 


Dunham, L etaL 


Plus 


14510206-14510398 
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334510 Dunham, Letai Plus 14522303-14522418 

334518 Dunham, Letai Rus 14630584-14630669 

33455 Dunham, LelaL Plus 14781 OSS-14781 137 

334528 Dunham, LelaL Rus 14787558-14787675 

334529 Dunham. Letai Pius 1478882-14788971 
334565 Dunham, L etaL Ptus 14989033-14989353 
334568 Dunham, Letai Plus 14992698-14983210 
334590 Dunham, L etaL Ptus 15033247-1033753 
334612 Dunham, LelaL Plus 15170470-15170535 
334626 Dunham, L etaL Pius 15299954-1530QOT 
334661 Dunham. L etaL Plus 15477716-15477788 
334664 Dunham, L etaL Ptus 15501941-15312119 
334666 Dunham, Letai Plus 15504203-15504279 
334675 Dunham, L etaL Plus 15516334-15516412 
334677 Dunham. LelaL Plus 15517449-15517560 
334719 Dunham, LelaL Plus 15778859-15779028 
334730 Dunham. L etaL Plus 15967830-15567934 
334797 Dunham, LelaL Plus 16383231-16383458 
334819 Dunham, Letai Plus 16758514-16758711 
334855 010413111,16181 Ptus 184Q2556-1B40Z735 
334900 Dunham, LelaL Plus 19315678-19315743 

334906 Dunham, LelaL Phis 19323493-19323590 

334907 Dunham, LelaL Plus 19338829-19336938 

334914 Dunham, LelaL Plus 19495150-19495275 

334915 Dunham, LelaL Plus 19505267-19506101 
334935 Dunham. LelaL .Plus 2010824749108373 
335019 Dunham, LelaL Ptus 20689525-2009582 
335036 Dunham. Letat Plus 2075525*20756375 
335049 Dunham. I. etaL Plus 20877184-20877309 
335081 Dunham. Letat Plus 2111387141113937 
335131 Dunham. LelaL Plus 21449834-21450003 
335157 Dunham. I. etaL Plus 2154330241544341 
335163 Dunham, L etaL Plus 2158350841583655 
335174 Dunham. LelaL Plus 2163130141631447 
335180 Dunham, L etaL Ptus 2164122641641603 

335188 Dunham, Letat Plus 21669116-21669328 

335189 Dunham. Letat Plus 2167340321673472 
335193 Dunham. LelaL Plus 2169220*21692362 

335200 Dunham, L etaL Plus 21743499-21743881 

335201 Dunham. LelaL Plus 2174524941745664 
335284 Dunham. LelaL Plus 2227283-22272712 
335311 Dunham, L etaL Plus 2250160242501676 
335320 Dunham, L etaL Ptus 2254213242542246 
335371 Dunham, Letat Plus 2284952942849808 
335377 Dunham. Letat Plus 2286990422870188 
335421 Dunham, LelaL Ptus 23250433-23250498 
335448 Dunham. I. etaL Plus 2347753243477831 
335451 Dunham, Letat Rus 2348061143480711 
335455 Dunham, Letat Plus 2348612743486216 
335468 Dunham, I. etaL Plus 2378724543787367 
335470 Dunham, Letai Ptus 2381287743813365 
335482 Dunham, Ldat Plus 2408795644088114 
335485 Dunham,!. etaL Plus 24098953-24099123 
335517 Dunham, Letai Ptus 2422646144226531 
335536 Dunham, Letat Plus 2449878344499762 
335565 Dunham, LelaL Plus 2488188644882470 
335585 Dunham, Letat Plus 2498824044988405 
335587 Dunham, Letat Plus 2499145744991618 
335593 Dunham, Letat Plus 2503725345037370 
335634 Dunham, Letat Plus 2516^3745164042 

335651 Dunham, L etaL Plus 2531756025317696 

335652 Dunham, Letat Plus 2532562545325883 
335667 Dunham, Letat Plus 25345735-2345856 
335687 Dunham, Letat Ptus 25445952-2446064 
335690 Dunham, letat Plus 25455442-2455625 

335692 Dunham, LelaL Plus 25468557-2468725 

335693 Dunham, L etaL Ptus 247136345471524 
335700 Dunham, Letat Plus 2512222-2512890 

335720 Dunham, t etaL Ptus 259654245596646 

335721 Dunham, LelaL Plus 25599118-25599228 
335733 Dunham, I. etat Plus 25686488-25686626 
335739 Dunham. Letai Plus 2569855045698826 
335745 Dunham, Letai Ptus 2571816445718332 
335747 Dunham, Letai Ptus 25723304^723494 
335783 Dunham, Letat Ptus 2594188245942040 
3359® Dunham. Letai Plus 277438434774403 
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335971 


Dunham, LetaL 


335875 


Dunham, L etaL 


335993 


Dunham, L eiaL 


335998 


Dunham, LetaL 


330)11 


Dunham, LetaL 


336035 


Dunham, LetaL 


336049 


Dunham, LetaL 


336055 


Dunham, LetaL 


336072 


Dunham, LetaL 


336075 


Dunham, 1. etaL 


336081 


Dunham, LetaL 


336083 


Dunham, LetaL 


336084 


Dunham, LetaL 


336086 


Dunham, LetaL 


336087 


Dunham, LetaL 


336088 


Dunham, LetaL 


336089 


Dunham, L etaL 


336090 


Dunham, 1. etaL 


336091 


Dunham,Letai 


336092 


Dunham, LetaL 


336094 


Dunham, LetaL 


336102 


Dunham, LetaJ. 


336124 


Dunham, LetaL 


336125 


Dunham, 1. etaL 


336127 


Dunham, LetaL 


336128 


Dunham, LetaL 


336171 


Dunham, LetaL 


336214 


Dunham, LetaL 


336306 


Dunham, LetaL 


3363.16 


Dunham, LetaL 


336333 


Dunham, L etaL 


336387 


Dunham, LetaL 


336442 


Dunham, LetaL 


336481 


Dunham, LetaL 


336489 


Dunham, LetaL 


336498 


Dunham, LetaL 


336502 


Dunham, LetaL 


336516 


Dunham, LetaL 


336548 


Dunham, LetaL 


336590 


Dunham, LetaL 


336593 


Dunham, LetaL 


336606 


Dunham, LetaL 


336623 


Dunham, LetaL 


336632 


Dunham, LetaL 


336633 


Dunham, LetaL. 


336634 


Dunham, LetaL 


336635 


Dunham, LetaL 


33837 


Dunham, LetaL 


336638 


Dunham, LetaL 


336654 


Dunham, LetaL 


336665 


Dunham, LetaL 


336695 


Dunham, L etaL 


336705 


Dunham, LetaL 


336751 


Dunham, LetaL 


336753 


Dunham, LetaL 


336757 


Dunham, LetaL 


336759 


Dunham, LetaL 


336808 


Dunham, LetaL 


336813 


Dunham, L etaL 


338827 


Dunham, tetaL 


336833 


Dunham, LetaL 


336836 


Dunham, LetaL 


336878 


Dunham, LetaL 


336898 


Dunham, 1. etaL 


336917 


Dunham, LetaL 


336924 


Dunham, LetaL 


336934 


Dunham, LetaL 


336958 


Dunham, LetaL 


3369S8 


Dunham, LetaL 


336985 


Dunham, LetaL 


337032 


Dunham, LetaL 


337121 


Dunham, LetaL 


337132 


Dunham, LetaL 


337168 


Dunham, LetaL 


337170 


Dunham,LetaL 


337172 


Dunham, LetaL 



Plus 


2775280847753017 


Plus 


2780132147801391 


Pha 


2800639648006571 


Plus 


28026247-28026356 


Plus 


2857076048570900 


Plus 


2901674849017410 


Plus 


2912445149124606 


Pius 


2916635249166573 


Plus 


2926940949269575 


Phis 


2934393849344050 


Plus 


29372270-29372407 


Plus 


29379738-29379881 


Plus 


2938066449380792 


Pfas 


29385594-29385713 


Plus 


29387122*29387253 


Plus 


2938900149389124 


Plus 


29401421-29401625 


Plus 


29413020-29413162 


Phis 


2941548649415617 


Plus 


2943881649438948 


Plus 


2956793449568099 


Plus 


2968150049681687 


Plus 


30053441-30053500 


Plus 


30056541-30056622 


Plus 


30058871W0058029 


Plus 


3005929440059367 


Plus 


30223877-30224048 


Phis 


3056198&30562044 


Plus 


3317294943173281 


Plus 


3333877643338919 


Plus 


33673947-33677345 


Plus 


3401393944014124 


Plus 


34187937-34188061 


Plus 


34231611-34231720 


Plus 


34239511-34239619 


Plus 


3426734044267424 


Plus 


3426895344269083 


Phis 


34296068-34296249 


Plus 


34353881-34354826 


Plus 


2569426-2570275 


Phis 


4617967-4618783 


Phis 


16170215-16170415 


Plus 


167287-167616 


Plus 




Plus 


985591-886221 


Phis 


986296-986670 


Plus 


98790W88364 


Plus 


9892764S0813 


Plus 


99190&493240 


Plus 


1588782-1588858 


Plus 


2007241-2007363 


Plus 


24646094464739 


Plus 


281766&4817723 


Phis 


44996294499739 


Phis 


45095094509601 


Phis 


45411674541466 


Plus 


45605264560699 


Plus 


625163*6251761 


Phis 


6436416-6436719 


Plus 


6773762-8774057 


Plus 


685650*6856634 


Plus 


7077262-7077326 


Plus 


9200300-9200399 


Plus 


10138365-10138458 


Plus 


11228329-11228403 


Plus 


11525273-11525527 


Phis 


11854506-11854792 


Plus 


13203550-13203973 


Plus 


13660250-13660530 


Phis 


14785058-14785225 


Plus 


16949229-16949306 


Plus 


2205028942050370 


Phis 


2225523442255437 


Plus 


2350359643504240 


Plus 


2353376343534008 


Plus 


2360835643608526 
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337218 


Dunham, LeLaL Plus 


337219 


Durham, Letai Plus 


337238 


Dunham, L etaL Pius 


337238 


Dunham LeLaL Pfas 


337259 


Lfanhany Letai Plus 


337291 


Dunham. L etaL Plus 


337298 


Dunham, L elal Pius 


337304 


Dunham. L etaL Phis 


337333 


Dunham. L eLaL Pfcm 


337338 


Dm ham 1 otal Phs 


337345 


Duiham LeLaL Phis 


337355 


Dunham 1 eLaL Phis 


337383 


Dunham. LfiLaL Phis 


337396 


Dunham 1 pt a! Phn 


337402 


Dunham. LeLaL Phis 


337403 


Dunham. L eLaL Phis 


337425 


Dunham. 1 eLaL Plus 


337426 


Dtmham LeLaL Phis 


337427 


Dunham L eLaL Phis 
wuiiiHiiif tm cm run 


337435 


Dunham 1 ofal Phn 


337484 


Dunham 1 ofal Pfatc 


337489 


Dunham 1 ofal Phn 


337503 


Dunham 1 total Mr 

L/Ulltaiil. I* BUUi i ltd 


337504 


Dunham 1 rial Phm 


33/522 


Dunham 1 ofal Phie 


337544 


Dunham 1 ofal Pkn 

ULU Brail, U BLoL rUO 


337548 


Dunham 1 of nl Pfcre 


337562 


Dunham 1 ofal Pfin 


337564 


Dunham 1 ofol PIir 
uuj uioiij. u cuol run 


337586 


Dunham 1 ofal PJim 


337628 


DunhanL I ofal PfVre 


337657 


Dinham f ofal Plin 


337685 


Ditnham 1 of si Pfcic 


337695 


Dunham 1 of al DIin 


337697 


Dunham 1 of si Dhta 


337 756 


Dunham 1 of al Dhie 


337781 


Him ham 1 of at Pbte 


337797 


Dunham 1 of at Dfcn 


337835 


Dunham 1 of at Dhie 


337843 


Dunham 1 of al Ph ra 


337865 


Dtnham I of al Ptirn 


337870 


Dunham 1 of al Phm 


337904 


Dunham I of at Pbie 


337918 


Dim ham 1 of a! Dhie 


337930 


Dunham 1 at al Phis 


337947 


Dunham. I of al Pbn 


337951 


Dunham 1 of al Phm 


337954 


Dunham 1 ofaJ Phie 


337964 


Dinham 1 of al Phia 


338043 


Dunham 1 of at Phn 


338054 


Dunham 1 of al Phm 
vuiuhuii, u cmi. rW9 


338062 


Dunham 1 of al Mr 


338080 


Dunham ! of al Phm 

UtllBKUIL, U cLOL rlUo 


338090 


Dunham 1 of a! Phm 


338096 


- Dunham 1 of al Phn 


338120 


Dunham 1 pf al Phis 


338123 


Dunham 1 at at Phs 


338124 


Dunham l fit Phis 


338177 


Dunham. L of al Phis 


338239 


Dunham L elaL Phis 


338278 


Dunham. 1 etaL Phis 


338280 


Dunham | eLaL Plus 


338326 


Dunham. L eLaL Phis 


338338 


Dunham. L eLaL Phis 


338344 


Dunham. L of at Phis 


338411 


Dunham. L eLaL Phis 


338412 


Dunham i ofai Plus 


338491 


Dunham, LeLaL Plus 


338514 


Dunham, Letai Plus 


338516 


Dunham, LeLaL Phs 


338521 


Dunham, L etaL Plus 




Dunham, L etaL Plus 


338526 


Dunham, L etaL Plus 


338577 


Dunham, LetaL Plus 


338529 


Dunham, Letai Plus 


338579 


Dunham, Letai Plus 



281264^26126585 

261296^26130053 

2711431827114571 

2714146527141776 

27674026-27674142 

2882330828823412 

29BB509290Q2 808 

29131691-29131823 

3028138^30281673 

304271^2-304277^ 

23649387^30649487 

3033ffl38^K33O42 

30942712-305^840 

31^5902^1586067 

31691134^16914^ 

316923 19-31692611 

32072 241-32072334 

32)76190-32076349 

32078958-32079076 

3225571&32255844 

33238033-33238176 

3329572^33295872 

33385583~33385857 

33386053^3386238 

33963188-33963979 

34^412^342684% 

34472882^4472957 

278105278281 

285643285788 

952102-952250 

'20138382014032 

28448512845055 

3547161-3547245 

3609670-3509808 

3644971-3645195 

3975015-3975172 

41160494116265 

4325712-4325811 

47922994792423 

498496 84985082 

5377705-5377861 

544251(^442636 

56858195686012 

6031241-6031356 

639356&£393692 

667564 3-6675781 

6766321-6766382 

68314838831620 

7032720-7032802 

82221384222430 

8429258*429382 

85788524579012 

9212114-9212220 

94886194488858 

9898397-9698469 

10765673-10765820 

10815824-10815886 

1080)311-1080)471 

12781530-12781724 

14669918-14670016 

16168714-16168840 

16171884-16172039 

17358806-17356964 

17453324-17453448 

17780512-17780623 

19299273-19299522 
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2116843021168556 
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21552237-21553280 

21635616-21635867 
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nimSam t olal 
UUnirain, 1. 6LoL 


338648 * 


ntmkwn 1 Mai 

uunnam, lguu. 


338679 


uunnam, l bul 


338703 


uiiinam, LeLat 


338705 


uunnam, loisl 


330747 


uunnam, Leiai. 


490770 


uunnam, l etaL 


338809 


uuraiam, L etaL 


338830 


uunnam! l&jbl 


338880 


uunnam, Letst 


3388%' 


nimkxM 1 Mini 

uunnam, LetaL 


338896 


uunnam, L etai 


338911 


uunnam, LetaL 


338825 


Dunham, LetaL 


3389S2 


Dunham, LetaL 


338359 


uunnam, Letat 


338970 


uunnam, L etaL 


339081 


uunnam, i.eiaL 


339116 


uunnam, LetaL 


339164 


Dunham, LetaL 


339246 


uunnam, L etai. 


339271 


uunnam, LetaL 


339304 


uunnam, LetaL 


339309 


Dunham, LetaL 


339312 


Dunham, 1. etaL 


339356 


Dunham, t etaL 


339357 


Dunham, LetaL 


339360 


Dunham, LetaL 


339384 


Dunham, LetaL 


339413 


Dunham, 1. etaL 


339419 


Dunham, LetaL 


332854 


Dunham, LetaL 


332859 


Dunham, LetaL 


332896 


Dunham, LetaL 


332995 


Dunham, L etaL 


333014 


Dunham, LetaL 


333043 


Dunham, LetaL 


333054 


Dunham, LetaL 


333069 


Dunham, LetaL 


333074 


Dunham, LetaL 


333101 


Dunham, LetaL 


333140 


Dunham, LetaL 


333142 


Dunham, LetaL 


333175 


Dunham, LetaL 


333216 


ft. - i f _,» 

Dunham, LetaL 


333219 


Dunham, LetaL 




Dunham, LetaL 


333240 


Dunham, LetaL 


333242 


Dunham, L etat 


333251 


uunnam, LetaL 


333?fi2 


Dunham, LetaL 


333257 


Dunham, 1. etaL 


333258 


Dunham, LetaL 


333271 


Dunham, LetaL 


333294 


^ > ■ _* _ f 

Dunham, LetaL 


333298 


Dunham, LetaL 


333312 


Dunham, 1. etaL 


333313 


Dunham, LetaL 


333323 


uunnam, LetaL 


333329 


uunnam, LetaL 


333339 


nimlMm 1 Mai 

uunnam, i. etat 


333346 


Dunham, LetaL 


333355 


uunnam, l etaL 




uunnam, I. etaL 


0004Z3 


uunnam,!, etaL 




uunnam, i. etai. 




uunnam, LetaL 


333466 


EJIMiMfIL I OT3L 


333495 


Dunham, LetaL 


3334% 


Dunham, I. etai 


333541 


Dunham, LetaL 


333545 


Dunham, LetaL 


333561 


Dunham, LetaL 


333568 


Dunham, LetaL- 


333581 


Dunham, LetaL 


333582 


Dunham,LetaL 
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333&5 


DunhanUetaL Minus 


333713 


DunhanUetaL Minus 


333730 


Dunham, LeLaL Mhus 


333737 


DunhanUetaL Minus 


333762 


Dunham, LetaL Minus 


333815 


Dunham, L eLaL Minus 




DunhanUetaL Minus 


333903 


Dunham, Lata! Mhus 


333310 


Dunham, L etai Minus 


333944 


DunhanUetaL Minus 


333965 


Dunham, L etai. Mhus 


334035 


Dunham, LetaL Minus 


334125 


Dunham, L etai Mnus 


334137 


Dunham, LetaL Mms 


334172 


DunhanUetaL Minus 


334175 


Dunham, L etai Minus 


334191 


Dunham, LetaL Mnus 


334196 


Dunham, LetaL Minus 


334198 


Dunham, LetaL Minus 


334215 


Dunham, LetaL Minus 


334237 


DunhanUetaL Minus 


334254 


Dunham, LetaL Minis 


334304 


Dunham, LetaL Mhus 


334358 


DunhanUetaL Minus 


334396 


DunhanUetaL Minus 


334399 


Dunham, LetaL Mnus 


334418 


Dunham, LetaL Minus 


334470 


Dunham, LetaL Mnus 


334474 


Dunham, LetaL Mnus 


334476 


Dunham, LetaL Minus 


334496 


Dunham, LetaL Mnus 


334540 


Dunham, LetaL Mnus 


334548 


Dunham, LetaL Mnus 


334560 


Dunham, LetaL Mnus 


334591 


DunhanUetaL Mhus 


334593 


Dunham, LetaL Mnus 


334625 


Dunham, LetaL Mnus 


334650 


Dunham, LetaL Mnus 


334682 


Dunham, LetaL Mhus 


334690 


DunhanUetaL Mnus 


334691 


Dunham, L etai Mnus 


334745 


Dunham, LetaL Mhus 


334752 


Dunham, LetaL Mnus 


334759 


Dunham, LetaL Mnus 


334760 


Dunham, LetaL Minus 


334764 


Dunham, LetaL Mhus 


334766 


Dunham, LetaL Mnus 


334779 


Dunham, LetaL Mnus 


334782 


Dunham, LetaL Mnus 


334783 


Dunham, LetaL Mnus 


334784 


DunhanUetaL Mnus 


334785 


Dunham, LetaL Mhus 


334788 


Dunham, LetaL Mnus 


334793 


Dunham, LetaL Mnus 


334804 


Dunham, LetaL Mnus 


334823 


Dunham, LetaL Mhus 


334827 


Dunham, LetaL Mhus 


334836 


Dunham, LetaL Mhus 


334642 


Dunham, LetaL Mnus 


334946 


DunhanUetaL Mnus 


334948 


Dunham, LetaL Mnus 


334955 


Dunham, LetaL Mhus 


334958 


Dunham, LetaL Mnus 


334966 


Dunham, L etaL Mhus 


334969 


Dunham, LetaL Mnus 


334970 


Dunham, LetaL Mhus 


335000 


Dunham, LetaL Mhus 


335MB 


Dunham, LetaL Mnus 


335009 


Dunham, LetaL Mnus 


335068 


Dunham, LetaL Mnus 


335090 


Dunham, LetaL Mhus 


335092 


DunhanUetaL Mnus 


335108 


DunhanUetaL Mnus 


335115 


Dunham, LetaL Mhus 


335116 


DunhanUetaL Mnus 


335217 


Dunham, LetaL Mhus 



69754716975215 
7289490-7289337 
7489326-7483007 
75519817551842 
76794657679343 
78719147871764 
81938454193735 
82169563216810 
8230221-8230116 
6557051-8556936 
86274844826258 

10433521-10432612 

10461063-10460708 

11644142-11644008 

11668659-11668597 

11932215-11932091 

11966322-11966182 

12032384-12032146 

12670395-12670077 

12994394-12993954 

13199268-13138999 

13455315-13455219 

13724372-13724201 

14164482-14164340 

14166289-14186163 

14237235-14237028 

14389581-14389442 

14391920-14391809 

14394777-14394589 

14480743-14480669 

14832385-14832224 

14837304-14837150 

14980648-14980485 

15035270-15035057 

15036374-15036242 

15244936-15243666 

15371251-15371178 

15520806-15520668 

15603055-15602918 

15615920-15611418 

16049960-16049653 

16118059-16117848 

16134427-16134207 

16136658-16136578 

16151208-16151104 

16154653-16154529 

16276861-16276815 

16291693-16291574 

16293336-16293226 

16294548-16294360 

16295583-16295408 

16301237-16301081 

16330748-16330681 

16428342-16428208 

16851360-16851189 

16905877-16905684 

17215581-17215413 

17464352-17464181 

2014051440140410 

20141727-20141583 

2015175340151684 

20156046-20155932 

20169171-20163079 

2018817&201 88020 

20195886-20195554 

2045540320455348 

2047914340479008 

20602011-20601951 

2095560&209S458 

2119297941190943 

21298638-21298571 

21324221-21324030 

2138825041388146 

2138857341388414 

2180162241801195 
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335234 


Dunham,LetaL 


Minus 


335236 


Dunham, LetaL 


Minus 


335247 


Dunham, L etaL 


Minus 


335262 


Dunham, L etal 


Minus 


335265 


Dunham, LetaL 


Minus 


335266 


Dunham, LetaL 


Minus 


335281 


Dunham, 1. etaL 


Minus 


335331 


Dunham, LetaL 


Minus 


335339 


Dunham, L etaL 


Minus 


335340 


Dunham, LetaL 


Mnus 


335344 


Dunham, LetaL 


Minus 


335348 


Dunham, L etaL 


Minus 


335349 


Dunham, LetaL 


Minus 


335523 


Dunham, LetaL 


Minus 


335527 


Dunham, LetaL 


Minus 


335566 


Dunham, LetaL 


Minus 


335606 


Dunham, LetaL 


Wnus 


335750 


Dunham, LetaL 


Minus 


335755 


Dunham, LetaL 


Minus 


335768 


Dunham, LetaL 


NBnus 


335774 


Dunham, LetaL 


Minus 


335777 


Dunham, LetaL 


Minus 


335782 


Dunham, LetaL 


Minus 


335787 


Dunham, LetaL 


Minus 


335808 


Dunham, LetaL 


Minus 


335817 


Dunham, LetaL 


Minus 


335827 


Dunham, LetaL 


Minus 


335831 


Dunham, LetaL 


Minus 


335832 


Dunham, L etaL 


Mtaus 


335863 


Dunham. LetaL 


Minus 


335895 


Dunham, LetaL 


Minus 


335902 


Dunham, LetaL 


Minus 


335920 


Dunham, I. etaL 


Mams 


335956 


Dunham, LetaL 


Minus 


335980 


Dunham, LetaL 


Minus 


335999 


Dunham, LetaL 


Minus 


338000 


Dunham, 1. etaL 


Aflnus 


336029 


Dunham, LetaL 


Minus 


336142 


Dunham, LetaL 


Minus 


336149 


Dunham, LetaL 


Minus 


336161 


Dunham, LetaL 


Rfinus 


336177 


Dunham, LetaL 


Minus 


336198 


Dunham, L etaL 


Minus 


336199 


Dunham, LetaL 


Minus 


336200 


Dunham, L etaL 


Minus 


336202 


Dunham, LetaL 


Minus 


338203 


Dunham, LetaL 


Minus 


336227 


Dunham, LetaL 


Minus 


336231 


Dunham, LetaL 


Minus 


336232 


Dunham, LetaL 


Minus 


336238 


Dunham, LetaL 


Minus 


336243 


Dunham, LetaL 


Minus 


336246 


Dunham, LetaL 


Mnus 


336279 


Dunham, LetaL 


Mktus 


336280 


Dunham, LetaL 


Minus 


336294 


Dunham, L etaL 


Minus 


336295 


Dunham, LetaL 


Minus 


336328 


Dunham, LetaL 


Afinus 


336347 


Dunham, 1. etaL 


Mnus 


336351 


' Dunham, LetaL 


Minus 


336397 


Dunham, LetaL 


Minus 


336400 


Dunham, LetaL 


Minus 


336402 


Dunham, 1. etaL 


Mnus 


336406 


Dunham, LetaL 


Minus 


336417 


Dunham, LetaL 


Minus 


336455 


Dunham, LetaL 


Minus 


336463 


Dunham, 1. etaL 


Mhus 


336479 


Dunham. LetaL 


Minus 


336510 


Dunham, LetaL 


Minus 


336538 


Dunham, 1. etaL 


Pfinus 


336584 


Dunham, L etaL 


Minus 


336599 


Dunham, LetaL 


Minus 


336615 


Dunham, LetaL 


Minus 


336616 


Dunham, LetaL 


Minus 


336618 


Dunham, LetaL 


Minus 


336645 


Dunham. LetaL 


Minus 



21904287-21904080 

2191501621914870 

21944287-21944209 

22065821-22065630 

22102127-22102)00 

22103492-22108411 

22237614-22237503 

2255682322556708 

225692KW2569045 

2270500-22570342 

22600566-22600498 

22646377-22646122 

22661861-22661271 

24235282-24235175 

2424151924241370 

2489222-24891998 

25056719-25055010 

25732501-2731972 

2576380625763747 

2581982125819690 

2588373325883572 

2588577025885599 

2590857825908440 

2594702225946835 

2621747526217146 

2632187526321750 

26380557-26380472 

2632152626321349 

2638370126383563 

2669301726692763 

2697530726975239 

2698549126985358 

270 34927- 27034811 

2765372927653635 

27846751-27846614 

2803398628033848 

28105547-28105354 

28732042-28731966 

30135157-30133)54 

3014818&30148059 

30169370-30169185 

30306638-30303284 

3045966630459460 

3046124530461040 

30462971-30462895 

30 46893 330468781 

30470796-30470653 

3090201430901946 

3 0967656 30967471 

3097770730977645 

3116008331159959 

3140223731402104 

3142566931425253 

3210320332102860 

3210358632103497 

3294699732946890 

3295029132950162 

3358446133584201 

3384321833843104 

3386589033865681 

3402150434021389 

3402343734023298 

3402409034023981 

3403077434030564 

3404777134047675 

3420915534209018 

3421223634211968 

3421822434218139 

3427704634276928 

3432927034329189 

3451656634516468 

11736500-11734418 

2602062226016546 

2602102726020848 

2579925612 

1351268-1351168 



212- 



WO 02/068677 



PCT/US02/06001 



336653 


Durinarn, I.eiaL Minus 


336657 


mm ■ - *- ft A - 1 ft my 

- Dunham, L etal Minus 


3K717 


Dunham, letal Mnus 


336772 


Dunham. LeLaL Minus 


336795 


Dunham, L etaL Minus 


336796 


Dunham, Letal Mnus 


33354 


Dunham, L etaL Mnus 


33689 


DunhanUetat Minus 


336882 


Dunham, L etaL Minus 


336863 


Dunham, L etaL Minus 


336883 


Dunham,Letal Mnus 


336908 


Dunham, Letat bfins 


336927 


Dunham, 1. etaL Minus 


336929 


Dutoam, Letal Mnus 


336978 


Dunham, L etaL Minus 


336981 
336984 


Dunham, Letat Minus 


Dunham, 1. etaL Minus 


336994 


0011)13111,16131 Mfous 


336999 


Dunham, L etaL Mnus 


337011 


Dunham, Letat Minus 


337023 


Dunham,!. etaL Minus 


337069 


Dunham, L etaL Mmus 


337032 


Dunham, L etaL Minus 


337CS3 


Dunham, Letal Minus 


337094 


Dunham, L etaL Mnus 


337097 


Dunham. Letat Mnus 


337100 


Dunham, Letal Minus 


337114 


Dunham, L etaL Minus 


337203 


Dunham, Letal Mnus 


337204 


Dunham, Letal Minus 


337225 


Dunham, letal Mnus 


337244 


Dunham, Letal Minus 


337279 


Dunham, Letat Minus 


337289 


Dunham, 1. etaL Minus 


337316 


Dunham, Letal Minus 


337431 


Dunham, Letal Minus 


337432 


Dunham, Letat Mnus 


337445 


Dunham, Letal Minus 


337451 


Dunham, Letal Minus 


337452 


Dunham, Letat Mnus 


337455 


Dunham, Letal Mnus 


337456 


Dunham, 1. etaL Mnus 


337513 


Dunham, LetaL Minus 


337514 


Dunham,! etaL Minus 


337517 


Dunham, letat Mnus 


337590 


Dunham, Letat Minus 


337592 


Dunham, Letal Minus 


337604 


Dunham, L etaL Affirms 


337606 


Dunham, Letat Mnus 


337612 


Dunham, letal bOus 


337645 


Dunham, letat Minus 


337704 


Dunham, letal Mnus 


337706 


Dunham. Letal Mfaus 


337760 


Dunham,! etaL Mnus 


337768 


Dunham, Letat Mnus 


337771 


Dunham, letal Minus 


337778 


Dunham, Letal Mnus 


337787 


Dunham, Letal Mnus 


337816 


Dunham, Letal Mnus 


337821 


Dunham, Letal Mnus 


337848 


Dunham, Letal Minus 


337911 


Dunham, Letal Mnus 


337913 


Dunham, Letat Mnus 


337974 


Dunham, letal Mnus 


337984 


Dunham, Letat Mnus 


336069 


Dunham, Letal Minus 


338087 


Dunham, Letat Moms 


000110 


Dunham, letal Minus 


338132 


Dunham, Letal Mows 


338140 


Dunham. letal Mnus 


338151 


Dunham. Letal Mnus 


338221 


Dunham, letal Minus 


338223 
338235 


Dunham, letal Mnus 


Dunham, Letal Mnus 


338271 


Dunham, letal Mnus 


338285 


Dunham, Letat Mnus 



108327-15K216 

1716497-1705415 

3297417-3297352 

51299396129785 

56815806681460 

5HJ3253468S016 

7884839-7884767 

82009484200789 

839459343943)9 

839667*4396425 

9322767-8322554 

10791347-10791256 

115481 11-11547937 

116G3O4O-11602908 

14241223-14241030 

14478638-14478472 

147S450-147S309 

15104752-15104656 

15389579-15388897 

16106423-16106080 

16346174-16346065 

19014379-19014222 

20140132-20133380 

2014569^-20145570 

2)146915-20146778 

2049864820497686 

20543830-2)543751 

21328259-21328050 

2534928&25349156 

2541902425418931 

26607638-26607468 

27362048-27361952 

28430185-28430033 

28756245-28756163 

29657129-&656997 

3211052^32110475 

3219913442198922 

32292380-32292228 

3240868042408576 

3241518742415117 

3243451742434425 

3243888742438828 

3347869243478562 

3348440843484339 

3379556843795385 

982397482246 

1007791-1007634 

1329838-1329759 

1346786-1346617 

1570235-1570142 

51414624141329 

36586704658545 

36720584671922 

40083884008037 

4 050134 4049973 

4060/97-4060707 

40873014087157 

41350974135025 

4523203-4523090 

453818^4538075 

50539644053829 

58665044866154 

614984*6149786 

7153401-7153085 

7282160-7282020 

67286554726580 

93620674361678 

10481242-10481049 

10989617-10989530 

11237977-11237876 

11519829-11519501 

14183649-14183568 

14287001-14286830 

14484254-14484179 

16048151-16047985 

16273B0-16274895 
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338308 


Dunham, L etaL 




Dunham, L etaL 


338368 


Dunham. LataL 


338381 


DwAam. L fitai 


338431 


dmham. LataL 


338450 


nimhafTL L etiiL 

WUIUIOJI*, la wLOfc 


338451 


Dunham. L etaL 


338488 


Dunham. 1. sLaL 


338508 


Dunham. L elaL 


338576 


fhnham I pit al 


338591 


Dunham. 1 etaL 


338593 


Dunham L eiaL 

lafUIUaallll la DUU> 


338594 


UUIIlMUil| la BUlk 


338668 


UUiHtdjn, laCLOla 


OOOQOO 


LoJIUkUII, kCUL 




Dunham 1 pf al 




Dunham 1 plat 

L/Ulllullll, !• OKU. 


338829 


Dinham 1 at al 
LSI! U Colli, la sWL 


338843 






Dimham 1 of al 


wwWIV 


Dimhnm 1 pt al 




Dimham 1 of al 


009UI9 


r\i tnhgtw 1 At al 

uunnainy t. cull 


339032 


uunnani, i. ta m. 


OtXJUOO 


UUIIfloJii, L uLaL 


339037 


uunnani, u cuu. 




Dinham 1 of al 


338071 


Dunham, L ataL 


339127 


Dunham, I. etaL 


339130 


Dunham, 1. etaL 


OOSIOI 


■JUnflanit 1* VLOL 


OOXMQQ 
0091 00 


uifuiarn, l guil 




Dunham, L etaL 


003190 


lAUUnJil, L Ha, 




Dunham, L etaL 


009&I0 


Ounham 1 of al 


OOulO 


Oi inliofM 1 *^ al 

uunnam, 1. wjbl 


339230 


uunnam, u etat 


QOQ0C4 


nunham 1 at al 




nimham 1 of al 


OOSZDO 


uunnam, i. etai. 


OOttcOO 


Dunham, L etaL 


ooaBW 


uunnam, l bul 


OOSOW 


Dunham, L etaL 




Dunham, L etaL 


339344 


ninham 1 of ol 

uunnam, l aiin. 




Dunham, L etaL 


009009 


uunnam, u wat 


ooswi 


uunnam, i. buxl 


0QQA9A 
009w* 


uunnam, l blol 




ni wihaww 1 t»4 al 

uunnam, u eiaL 


00/ ow 




00/009 




337895 




337898 




0£90l 1 


OOQaTtf O 




0900910 






32532) 


5866870 


325338 


5866883 


325271 


58669M 

OOOU9U 1 


325285 




325289 
325300 


58S6903 
5866908 


325304 


5866910 


329630 


6729060 


325363 


5866920 


325359 
325408 


5866920 
5866921 


325409 


5866921 


325410 
325393 


5866921 
5866921 


325395 
325433 
325448 


5866921 
5866936 
5666941 



Minus 


169a£19-169f£247 


Minus 


17241596-17241507 


Minus 


18272)87-18272025 


Minus 


18534404-18534234 


Minus 


19747602-19747486 


Minus 


20172384-20172200 


Minus 


20174288-20174193 


Minus 


21063152-21083049 


mil Hi? 


213flft161-21 307923 

A KND 1 W ITS iMif abtM 


minus 


aCZUf DO 19-VtaCO/UwOO 




ataSKWat.1 lTtC999 lOaC 


murk. 


99QQ9T1 A-99QQ9irt1 
tCmf.t ilrCCfSCM 1 


muHA 


*MfWfH^^ -OTfVJCnOQ 
aWww l*rr-r<wlf mUS9 


Mfawte 

nwius 


24500806-24500442 


HAnira 
IWulua 


ZWOOWO^'WOOZ'K) 


Minus 


«99V4 l-wnCSOUWl / 


Minus 




Minus 


at/ f OaAcOnc/ / 09100 


minus 


*!/ 0990aC lit/ Q93U*K7 


minus 


49UJQ l*ff nCSUdVVW 


MHU9 


£9faO/-Kr|nclKX}rZ99 


IW1U9 


OVU I990/ -sXIU ISOOr 


Mbms 


0W9OX/0-0UH90I10 


Nanus 


OUOIOWf -OUOIOaC/O 


Minus 


OLPOZIiU&^XakUOJU 


Minus 


OUDO^lW-OUoOOiwO 


Minus 


OU/ tlOOO-OU/XI #*w 


Minus 


0U009/ Utc-OUOOOOl 1 


M%HJ3 


J lOOOatOaC-O iOOOUf Z 


m%1US 


tlTlQOiM H7AT7R7 


MDIUS 


AUZ 109/ -sJicOZl Of 1 


Minus 


JyfiMf 99*t"0a»Tf £SU - 


Minus 




Minus 


PtZ*i^aZ*+0 1U3/ 


Minus 


0O>l OrTT-l A_0QAQ1 <9T7 
0c49i / l*T-0at**9l00/ 


Minus - 


OjOAQfHSQOJaftilOfiAA/. 
0Z^jw9911KK*Kj0*Wv 


MWIUS 




UJbmib 

mhiUS 


jZDl9lO<f>}ZOl9\Ja 1 


Mrmm 

Minus 


32729004-32728929 


Minus 


o^Kr4oi4-o7094^lD 


Minus 


32926055*3292^^ ft/ 


Minus 


oZy/ol 4osS»r 0U41 


MVIUS 


ooojMTnrL'WQQyi cnn 
OlfixWUrOjfiKMOUU 


Minus 


33114230-33114010 


Minus 


33485220-33485144 


Mmus 


OaTtiHOf f -OO'KHOHl 


Mmus 


33503675-33503520 


Minus 


33589964-33559665 


MmtM 

Minus 


00041 D1O-O0041 *WO 


MfmM 

minus 


0W*K)09O^0Hl«0T0O 


Minus 


34332464-34392311 


MintiQ 


0490090/ -0*KW00#0 


mutUa 


1 If OU-I lOf 3 


Plus 


lOu/lr IOOOO 


mqIU9 


1 f £3Q- 1 f OO/ 


MIraia 

tVaVtUO 




mil IU9 


R97aC7^i9Q9ft 

WJHt *<JUC3£0 


rftO 


Of u lO-O/ Of o 


Plus 


1 fT/779.1(T7flfii 

IUf f #ai-|UfOW^ 


Mfmis 


12971*^129877 


Minus 


794fUUK9 

/9*tlTVUU9 




Of •tOO^alf OOt 


Minus 


IUf OaUT*l Vf Oil 


Mmus 


1018736-1018830 


Plus 


392668-392781 


Plus 


599118489289 


Plus 


599438499855 


Pius 


615477-615580 


Minus 


350513%350691 


Plus 


476868-477200 


Minus 


48070^480826 


Mmus 


392834492982 
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325464 


5S56947 


325480 


5866957 


325432 




32510 


588374 


32524 


5866381 


325513 


6017035 


3&519 


6017038 


325571 


655243Q 


329638 


3779004 


225523 


5867000 


325636 


5867002 


325677 


58S7D17 

*MH#f U If 


325691 


5867021 


325690 


586/0/1 


32702 


5867028 


325703 


5857028 


325/90 


6381957 


325783 


(FKJUf Btf 


325695 


6552446 


325726 




32574? 


6552446 


325760 


OOw'rlS 


325801 
325803 


6552451 
6552451 


325649 


KWm 1 

6588011 


325710 


6682473 


325712 


6682473 


325755 


6682474 


32751 


6682474 


325763 


6682475 


<JUI 31 


OQOx't/O 




*8H 2*JQ7 






329760 






6065777 


329737 


6065779 


329735 


6065780 


329719 


finfi57R5 


3?9/7S 


6065785 


329705 


6065740 


323665 


627212 


329793 


6522661 


329824 


wJUiJO 


329839 
329853 


6672062 


329870 


6682295 
6706435 


329879 


6466518 


329902 


6634760 


325651 


5857067 


325886 


5867087 


325887 


5887087 


326005 


5867112 


325945 


5867138 

«MJOf l«JU 


32953 


5867140 


325955 


5WT7147 
300/ In/ 


32966 


5867147 

300/ 111 


325977 
325835 


6249602 




6552452 
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326253 
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328031 


5902482 


328053 


5902482 


328243 


6056292 


328271 


6552415 


328592 


5868227 


328570 


5868231 


3288)7 


5868233 


328620 


5868241 


328624 


5868246 


328791 


5868309 


328810 


5868327 


328820 
328835 


5868330 
5868339 


328282 


5868353 


328314 


5868371 


328328 


5868375 


328420 


5868411 


328428 


5868417 


328436 


5866417 






328462 


5868433 






328474 


5868446 


328484 


5868454 


328504 


5868471 


328506 


5868471 


328507 


5868473 


328544 


5868486 


328552 
328557 


5868489 
5868489 


328558 
328276 


5868489 
6004471 


328277 


6004471 


328662 


6004473 


328636 


6004473 


328803 


6004475 


328305 


6004476 


328569 


6004480 


328581 


6006033 


328582 


6006033 




60171131 


328770 


6017031 


328841 


638192) 


328851 


6381923 


328859 


6381928 


328860 


6381928 


328863 


6381929 


328868 


6381930 


328876 
328886 


6525286 


328888 


6588003 

WOOUUv 


328936 


5868500 


328938 


5868500 


328971 


6476806 


330338 


5457162 


330327 


5919194 


330319 


5932415 


328974 


5868520 


328981 
328989 


5868527 


33Q363 


5868535 
3126882 


330370 


6580495 


329041 


5868564 


329078 


5868597 


329097 


5868624 


329107 


5868626 


329114 


5868650 



Minus 


118396-118490 


Plus 


5098951246 


Mhus 


33127-33485 


Plus 


44102-44319 


Plus 


30307-30527 


Minus 




Minus 


477679478113 


Minus 


507572-508519 


Minus 


931937-932171 


Plus 


1176372-1177283 


Minus 


27098592710010 


Ptus 


1-243 


Plus 


39015-39098 


Minus 


252407-252565 


Plus 


8921049816 


Minus 


246798-246944 


Minus 


15651-15788 


Minus 


120668-120836 


Plus 


171592-171929 


Plus 


101730-101914 


Ptus 


90446-90602 


Plus 


8805348461 


Ptus 


72692-72819 


Minus 


288397-286505 


Plus 


169210-169407 


Plus 


5361243886 


Plus 


13599-13780 


Ptus 


203769203904 


Ptus 


6539346103 


Ptus 


4964949768 


Minus 


15954-16073 


Minus 


128777-128970 


Minus 


2197422140 


Plus 


47064-47217 


Plus 


6071640830 


Minus 


199637-199990 


Plus 


145659-145829 


Plus 


4732647607 


Plus 


138094-138161 


Plus 


14364M44108 


Plus 


13282-13450 


Minus 


279901-280181 


Plus 


1184773-1184855 


Plus 


192484-192543 


Minus 


291716-291948 


Minus 


3473044851 


Plus 


232898-233243 


Minus 


121249-121400 


Minus 


134177-134282 


Minus 


223741-224238 


Minus 


363933464166 


Minus 


52144479 


Plus 


29)22606 


Plus 


6904549138 


Pius 


8326543366 


Minus 


2931329506 


Plus 


112825-112993 


Plus 


9405344185 


Plus 


3106841429 


KBnus 


111901-111999 


Minus 


1352202-1352259 


Ptus 


1522923-1522986 


Atinus 


2397624105 


Plus 


4840W851B 


Plus 


121581-121683 


Ptus 


49095-50132 


Ptus 


3155741668 


Minus 


1(8677.105764 


Plus 


182088-182198 


Nflnus 


6183&41901 


Plus 


a none A 4 ceo 


Pius 


141592-141785 


Plus 


326798426860 


Phis 


12002-12170 


Plus 


101043-101190 


Minus 


2379223910 




218 



WO 02/068677 



PCT/US02/06001 



329116 


5888650 


329154 


58)8691 


329187 


S368M3 


329201 


5868718 


329221 


5668727 


329246 


58)8732 


323254 


5868733 


329326 
329330 
329382 


5868806 
5868806 
5868868 


329384 
329386 


58QHJ69 
6004484 


329140 


6O17Q60 
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TABLE 21: 

310 GENES DP-REGULATED IN COLON CANCER DERIVED LIVER 
METASTASES COMPARED TO NORMAL COLON TISSUE 

Table 21 shows 3 10 genes up-regulated in colon cancer derived liver metastases compared to 
normal colon tissue. These were selected from 59680 probesets on the ASymetrix/Eos Hu03 
GeneChip array such that the ratio of "average" colon cancer derived liver metastases to 
"average" normal colon tissues was greater than or equal to 3.0. The "average" colon cancer 
derived fiver metastases level was set to the 50th percentile. The "average" normal colon 
tissue level was set to the 50th percentile. 



Pfcey. Unique Eos pfubeset identifier mimbcr 

ExAccm Exemplar Accession number, Genbmk accession number 

Rl: Geoes up rocta vs nonna] 



Pkey ExAccn Unigene© Unigene Title Rl 

446619 AU076643 Hs.313 secreted phosphoprotein 1 (osteopontin. 26.72 

431958 X63629 Hs3877 cadherin 3, type 1, P^enn (placenta 1636 

409041 AB033025 Hs50081 WM1 199 protein 13.94 

444381 BE387335 Hs383713 ESTs, WeaHy similar to S64054 hypctheti 1190 

432314 AA533447 H&312989 ESTs 1234 

428330 L22524 Hs3256 matrix metalloprotelnasQ 7 {rratrflysfn, 11.60 

443162 T49951 Hs5Q29 DKFZP434GG32protem 952 

436385 BE551618 Hs.144097 ESTs 930 

418662 AI801098 Hs.151500 ESTs 9XJ0 

433312 AJ241331 Hs.131765 ESTs, Moderately similar to 138937 DNA/R 850 

412093 BE242691 Hs.14947 ESTs 8.74 

442369 AI565071 Hs.159983 ESTs 8.40 
426101 ALQ49987 Hs.166361 Homo salens mRNA; cDNA DKFZp564F112 (fr 839 

435937 AA830893 Hs.119769 ESTs 832 

452281 T93500 Hs38792 Homo sapiens cDNA RJ1 1041 fis, done PL 832 

432572 AI660840 Hs.191202 ESTs, WeaWy slmflar to ALUEJflJMAN Oil 756 

440524 R71264 Ha.16798 ESTs 7.94 

424878 H57111 Hs321132 ESTs 7.88 

430433 AM7B883 Hs3737G6 ESTs 752 

410245 C17908 Hs.194125 ESTs 7.78 

417315 AI080042 H&336901 ribcsomal prate* S24 7.76 

430665 BE350122 Hs.157367 ESTs, WeaWy slmflar to I78885 sennerlh 7.76 

432435 BE218886 Hs382070 ESTs 774 

426818 AA554827 Hs389115 DKFZp434A0131 protein 7.58 

419145 N99638 gteza39g11 Jl Soares fetal Over spleen 756 

444838 AV651680 HS308558 ESTs 754 

428046 AWB12795 Hs. 155381 ESTs, Moderately similar to 138022 hypot 7.48 

446682 AW205632 Hs311198 ESTs 736 
421221 AW276914 Hs526714 Homo sapiens done IMAG&713177, mRNA se 7.19 

440116 AI798851 Hs383108 hemoglobin, gamma G 7.12 

450230 AW016607 Hs301582 ESTs 7D8 
456332 AA228357 gfcnc39dQ5j1 NCI^CGAP_Pr2 Homo sapiens 7.04 

421814 L12350 Hs.108623 mrornbospondin 2 659 

440774 AI420611 Hs. 127832 ESTs 656 

428065 AI634046 Hs.157313 ESTs 6.78 

422330 D30783 Hs, 115263 epiregufin 6.72 

413950 AA249096 H&32793 ESTs 657 

438011 BE468173 Hs.145696 splicing factor (CC15) 6.62 

421057 T58283 Hs.10450 Homo sapiens cDNA:FU22063 fis, done H 658 

428698 AA852773 Hs534838 KIAA1866 protein . 6.40 

408806 AW847814 Hs389005 Homo sapiens cDNA: HJ21532 8s, cfcme C 658 

425787 AA363867 Hs.155029 ESTs 658 

435812 AA700439 Hs.188430 ESTs 652 
448974 AL049390 Hs32689 Homo sapiens mRNA; cDNA DKFZp58601318 (f 638 

418875 W19971 Hs333459 ESTs 632 

407284 A1539227 Hs314039 hypofteficaIpfotehFU23556 6.17 

408243 Y00787 Hs524 mteneutinS 6.12 

434936 A1285970 Hs.183817 ESTs 6.12 
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412088 AS89496 Hs.108932 ESTs 634 
450377 AB033091 Hs.74313 WAA1265 protein 630 
407618 AW054922 Ha53478 Homo sapiens cDNA FU12368 fis, done MA 538 
4082% AL1174S2 Hs.44155 DKFZP58SG1517 protein 534 
456999 AA3197S8 H&2965B1 eukaiyo&bansbtadD^afiontcictor 5i90 
432559 AW452948 Hs257631 ESTs " 538 

423349 ARM0258 Hs.127428 homeobaxA9 534 
436100 AA7O4806 Hs.143842 ESTs, WeaWy sfrnferto 20O43K3A chraraos 534 
453204 R10799 Ks. 191 390 ESTs 534 
429183 AB014604 Hs.197S55 WAA07D4 protein 5.78 
427882 AA640987 Hs.193767 ESTs 5.72 
447033 AQ57412 Hs.157601 ESTs 5.70 
428054 AB48688 Hs^6S519 ESTs 5.66 
414504 AW069181 Hs. 115175 stofle-alpha motif and leucine zipper c 5.64 
442806 AW294522 Hs.149991 ESTs 5.64 
418259 AA215404 Hs.137289 ESTs 5.60 
434963 AW974957 Hs288719 Homo sapiens cDNAFU12142fc, dona MA 530 
419899 AI760942 Hs.191754 ESTs 538 
431749 AL049263 Hs3CS292 Homo sapiens mRNA; cDNA DKFZp564F133 (fr 538 
422790 AA309875 Hs25933 ESTs 556 
440980 ALD42005 Hs.1117 tnpepfidyl peptidase D 5.48 
432451 AW972771 Hs292471 ESTs, WeaMy similar to ALUIJttJMAN ALU S 5.46 
438578 AA811244 Hs.164168 ESTs 5.44 
410467 AF102546 H&63931 dachsranid (Drosoph3a) hnmolog 542 
426317 AA312350 Hs.169294 transactor factor 7 (F«ell specffic, 5.42 
450164 A1233923 H&30098 ESTs 5.40 
438899 AF085833 Hs.135624 ESTs 538 
43345 AL043683 H&8173 hypofteftal protein FU10803 536 
437176 AW176909 Hs.42346 cataeurti^ntfrig protein cafeamn-1 534 
419829 AB24228 Hs. 115185 ESTs, Moderately sfcferto PC4259 feni 533 
4^7966 AA295Q52 H&38516 Homo sapiens, done MGW5887. mRNA, 00m 530 
447342 A1199268 Hs.19322 Homo sapiens, SimBartoRIKB^cDNA 2010 &26 
419682 H13139 H&92282 paie*flce homeocbmain transcripfionfa 526 
421097 AC80112 Hs.125232 Homo sapiens cDNAFU13266 fis. done OV 522 
443373 A1792868 Hs.135365 ESTs 522 
412059 AA317962 Hs249721 ESTs, Moderately sfmBarto PC4259 fern* 521 
443651 W22152 Hs282929 ESTs 521 
411274 NM_002776Hs39423 UromlO 5.17 
421999 U50535 Hs.110630 Hunan BRCA2 region, mRNA sequence CG006 5.17 
426981 AL044675 Hs.173081 K1AA0530 protein 5.14 
431319 AA07335O Hs302232 ESTs 5.10 
434966 AA557494 gtxntB6f04^1 NCLCGAP _Pr3 Homo sapiens 5.10 

418830 BE513731 Hs38959 hypoffleftal protein MGC4816 5D8 
428290 AI932995 Hs.183475 Homo sapiens done 25061 mRNA sequence 537 
408784 AW971350 Hs.63386 ESTs 504 
411975 AI916058 Hs.144583 ESTs 532 
409760 AA3Q2840 p^fSri0534Ad^sefesue,whtotHomo 437 

420717 AA284447 Hs271887 ESTs 436 
417035 AA192455 Hs22968 Homo sapiens done IMAGE:451939, mRNA se 435 
434442 AA737415 Hs.152826 ESTs 434 
441328 AJ98Z794 Hs.159473 ESTs 432 
438962 BE046594 gtxhn41c11_x1 NCLCGAP_RDF2 Homo sapiens 432 

451277 AKD01123 Hs26176 hypomefeal protein FU1Q261 432 
438403 BE273296 Hs254467 Homo sapiens CDNAFU13255 fis, done OV 430 
424950 AA602917 Hs.156974 ESTs 4J8 
436823 AW749865 Hs293645 ESTs, WeaWy similar to I38G22 hypofoefi 437 
444783 AK&M 468 Hs.62180 anffln (Drosophila Scraps homobg), ad 432 
444301 AKD00136 Hs.10760 asporin (LRR dass 1) 430 
445390 AE22165 Hs.144923 ESTs 430 
439608 AW864696 Hs301732 hypomefical protein M6C5306 478 
450503 NM_004460H&418 fibroblast acfiva&nprofe^a^ha 4.78 
432682 AJ3764C0 Hs.159588 ESTs 476 
426086 T94907 Hs.188572 ESTs 476 
435981 H74319 Hs.188620 ESTs 474 
432340 AA534222 gtenj21<H&s1 NCLCGAPJVV1 Homo saptens 4.72 

435756 AI418466 Hs33665 ESTs 472 
447982 H22953 Hs.137551 ESTs 472 
449509 AA001615 Hs34561 ESTs 472 
407946 AA226495 Hs.154292 ESTs 470 
426215 AW9S3419 Hs.155223 stanniocakin 2 4.70 
414783 AW069569 Hs278270 unacBve progesterone receptor, 23 KD 438 
417601 NM_014735Hs32292 K1AA0215 gene product 438 
438461 AW075485 Hs286049 p4wsphosentaam&^ 438 
449032 AA045573 Hs22900 nuctoradDr(efymroWemrod2>ft 438 
426501 AW043782 Hs293616 ESTs 437 
409024 AW883529 Hs.173830 ESTs, Weakly similar to AUJ7JiUMAN ALUS 437 
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439848 AW979249 gb£ST391359 MAGE resequences, MAGP Homo 456 

424762 AL119442 Hs.183684 eukar/otic translation Initiation (actor 456 
442007 AA30111B Hs.142838 nucleolar phosphcprotein Nopp34 452 
400632 W74Q01 Hs55279 serine (or cysteine) proteinase inhibfto 4.62 
432409 AA806S38 Hs.130732 KIAA1575 protein 4.60 
452220 BE158006 H&212298 ESTs 4.60 
442577 AA292&8 Hs.1 63900 ESTs 458 
434001 AW950905 Hs3697 seiine (or cysteine) proteinase Invito 458 
414271 AKD00275 Hs.75871 protein kinase C binding protein 1 458 
433854 AA610649 Hs333239 ESTs 456 
431315 AW972227 Ha163986 Homo sapiens cDNA: FU22765 fis, done K 453 
434220 AI174777 Hs783039 Homo sapiens PR02492 mRNA, compete cds 450 
467752 AI821270 Hs785643 Homo sapiens cDNA FU14364 fis, done HE 450 
449941 AW450536 HsT09260 ESTs 4.48 
415116 AA160363 Hs769956 ESTs 4.47 
414386 X00442 Hs.75990 haptoglobin 4.47 
422956 BE545072 Hs.122579 hypofteteal protein FU10461 4.44 
423974 AL118754 gbDKFZp761P1910_/l 761 (synonym: hamy2) 4.44 

449618 A1076459 Hs.15978 WAA1272 protein 4.44 
428279 AA425310 Hs.155768 ESTs, WeaWy simflar to A47582 B-cel gr 4.42 
430573 AA744550 Hs.1 36345 ESTs 4.42 
430929 AA489166 Hs. 156933 ESTs 4.40 
433530 BE349534 Hs781789 ESTs 4.40 
446039 T93096 Hs.17126 hypothetical protein MGC15912 4.40 
447082 T85314 Hs.42644 fttorcckMon^ke 439 
407168 R45175 Hs.117183 ESTs 438 
417067 AJ001417 Hs.81086 solute canterlamiy 22 (extraiieuronai 438 
408380 AF123050 Hs.44532 dlubkptin 436 
431379 AA504264 Hs.182937 pectidytproly] isomerase A ((^ctophiSn 436 
406671 AA129547 Hs.285754 met proto-oncogene (hepatocytB growth fa 434 
419317 AA236282 Hs. 172318 ESTs 432 
450295 AI766732 Ks71GS28 ESTs 432 
423578 AW960454 Hs722830 ESTs 431 
419553 N34145 Hs750814 ESTs, Moderately simaar to ZN91JiUMANZ 431 
429512 AA453987 Hs.144802 ESTs 430 
426848 H72531 Hs36190 ESTs 430 
429831 AA564489 Hs.137528 ESTs 430 
433735 AA608955 Hs.109653 ESTs 430 
450546 AA010200 Hs.175551 ESTs 477 
421059 A1654133 Hs30212 thyroid receptor interacting protein 15 477 
413243 AA769266 Hs.193657 ESTs 476 
433230 AW136134 Hs720277 ESTs 472 
439717 W94472 Hs.59529 ESTs. Moderately simflar to AUJ1_HUMAN A 470 
439362 AI954880 Hs.134604 ESTs 4.19 
450157 AW961576 Hs.60178 ESTs 4.17 
451690 AW451469 Hs709990 ESTs 4.17 
418681 NM.001949Hs.1189 E2Ftranscripflon factor 3 4.16 
443135 AI376331 Hs.1 56 1 03 ESTs 4.16 
443148 AKB4357 HsT1 1194 ESTs, WeaWysfrnflar to ALUSJtiJMAN ALU S 4.16 
407765 AW076Q27 Hs757711 ESTs, Moderately similar to ALIKLHUMAN A 4.14 
428825 AM84336 Hs.128783 ESTs. WeaHysim8ar to 138022 hypomefi 4.14 
447519 U46258 Hs339665 ESTs 4.14 
439451 AFD86270 Hs778554 hetenxtornafin-Ske protein 1 4.12 
450219 AI826999 Hs724624 ESTs 4.12 
431451 AA761378 Hs.1 92013 ESTs 4.11 
432917 NNL014125HS779812 PRO0327 proteh 4.10 
431328 AA502999 Hs791591 ESTs 4.09 
425992 AA367069 Hs.100636 ESTs 4.0S 
404571 4.06 
420911 U77413 Hs.100293 Ofinked N^cetyiglucosarnine (GlcNAc) tr 4.06 
421114 AW975051 Ks7S3156 ESTs. WeaHyslmfer to 178885 serine/fo 4X6 
432731 R31178 Hs787820 fibronectinl 4.06 
433588 AM56872 Hs.1 33386 ESTs 4X6 
434658 AK24436 Hs31Q286 ESTs 4.06 
444040 AF204231 Hs.182982 gcJgIn-67 4.08 
444984 H15474 Hs.132898 fatty acid desaturase 1 4X6 
438543 AA810141 Hs.192182 ESTs 4.05 
413497 BE177661 gb«C1-KTO9W)2030(W)1 1-W32 HT0598 Homo 4X4 

434575 A1133446 Hs799964 Homo sapiens clone FLB7723 PR02D55 mRNA, 4X4 
43)256 AA470152 Hs.192195 ESTs 4.04 
424839 AA740632 Hs.120850 ESTs, WeaWy sfrnSarto ALU1 JflJMAN ALU S 4X2 
429048 AI372949 Hs.44241 Homo sapiens cONA: FU21447 fis, done C 4.02 
449429 AA054224 Hs59847 ESTs 4X2 
410762 AF226053 Hs.66170 HSKM-B protein 4.00 
418876 AA740616 gbcny97f11^1 NCLCGAP.GCB1 Homo sapiens 4X0 

425905 AB032959 Hs318584 novel C3HC4 type Zinc finger (ring finge 4X0 
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429500 X78565 H&289114 taatahion (teia^ 400 
431393 AW971493 Hs.134269 ESTs* ^stm3ar to cytokine recepto 4.00 

435008 AF15Q262 Ns.162898 ESTs 4JD0 
431361 AW971375 Hs 292921 ESTs 357 
444816 Z48633 H&283742 Ksap^mmtordrntianspcstgi 196 
434701 AA460479 Hs321707 KIAA0742protefo 198 

413866 AW958264 Hs.103832 anto to yeastUpC, variant B 3J95 

424905 NMJQ2497HS.153704 NIMA (never ki mitosis genea)-reiated k 192 

42S479 Y0Q272 Hs.184572 ce9tiristoncyde2,G1 toSandG2to 3J91 

435714 AA633325 H&269860 ESTs 188 

447514 AI809314 Hsu208501 ESTs, WeaWys*naartoB34Q87hypoftefi 186 

453818 BE256832 Hs. 10711 hypoft^cal protein RJ13449 185 

433586 T853Q1 flteyd78dGS*1 Scans fey Bver spleen 185 

440638 AB76K1 gtete64e10jrf SoaiesJtfU_T_GBC_S1 Homo s 185 

417819 AQ53112 Hs.133540 ESTs 184 

4M55S BE244200 Hs£5075 KIAA0410ger» product 183 
423129 L44396 Hs. 1241 06 Homo sapiens cDNAFLH 1941 6s, clone HE 183 

453884 AA355925 HsJ6232 K1AA01 86 gene product 183 

431193 AW7495Q5 Hs296770 WAA1719 protein &81 

408262 AKD00631 Hs-52256 hypofiiefical protein FU2Q624 180 

425568 AW963118 Hs.161784 ESTs 178 
441085 AW136551 Hs.181245 Homo sapiens cDNARJI^ fis, done NT 177 

428079 AA421020 Hs 208919 ESTs 177 

412490 AWB03564 H&288850 Homo sapiens cDNA:FU22528fe l done H 176 

435354 AA678267 Hs.117115 ESTs 175 

436535 AW295687 Ha254420 ESTs 174 

420439 AW270041 Hs.193053 eukaryofc transbfion tnfe&n factor 172 

436090 AJ640635 Hs.116468 EST 171 

416265 AA177088 Hs.190065 ESTs 170 

417715 AW969587 H&86366 ESTs 167 

435677 AA694142 H&293726 ESTs, WeaHy storibrto TSGA RAT TESTIS 167 

4380)7 AW08Q237 H&252884 ESTs 166 

408194 AA601038 Hs.191797 ESTs, Weakly similar to S55657 aJpha-1C- 165 

417211 T97617 HsJ269092 ESTs 160 

435538 AB011540 Hs.4930 lowdensSyBpoprote^receptor-fBlated 159 

410390 AA876905 Hs.125286 ESTs 3-58 

438818 AW979008 H&222487 ESTs 157 

431416 AA532718 Hs.178604 ESTs 157 

433517 AW022133 Hs.189838 ESTs 156 

428355 BE256452 H&2257 varonecfin (serum spreading factor, som 156 

432954 A/076345 H&214199 ESTs 3,53 

434466 ABQ37829 Hs3862 regulator of nonsense tianscnpts 2; DKF 153 

421933 R98881 Hs.109655 sex comb mkfleg (Drosoph3a)4ke 1 152 

422082 AA016188 Hs.11 1244 hypoftefical protein 152 



437135 ALD38624 H&208752 ESTs, WeaWy similar to ALU8JUJMAN ALUS 149 

424723 BE409813 Hs. 152337 protein argfafne f^me(hyffransfef3se 3(h 149 

434280 BE005398 gb.^-BN0116-1504C0-1894iQ2BNO116Hcfl» 149 

407289 AA135159 Hi20334S Homo sapiens cDNA FU12149 fis, done MA 148 



417670 R07785 gteyf15c0Sj1 Soares fetal Iverspteen 148 

431615 AW295659 Hs235860 ESTs 148 

429355 AW973253 Ks_292689 ESTs 145 

430068 AA464964 gbaxSOflO^I Soaresovary tumor NbHOTH 145 

432929 AW207166 Ks.191265 ESTs 3.44 

437763 AA469369 H&5831 tissue Inhiitorof mefelJopraiemse 1 144 

445674 BE410347 Hs. 13053 transcription factor CA150 142 

408113 T82427 Hs.194101 Homo sapiens dm FU20869 fis, ctoneA 142 

408906 BE2962Z7 H&250822 sotoefflieonto Hnase 15 141 

432235 AA531129 Ha 190297 ESTs 3.41 

453985 N44545 Hs^51865 ESTs 3.41 

415736 AA827082 Hs^91872 ESTs 3.38 

430220 BE378277 Hs.152230 ESTs 137 

426510 AWB61225 Hs.194637 BANPhomcbg, SMAR1 homotog 137 

412104 AW2Q5197 Hs^40951 Homo sapiens, Sinflar to RiKEN cCNA 2210 3.36 

411573 ABQ29000 Ks.70823 WAA1077 protein 133 

413816 AW958181 Hs.189998 ESTs 332 

428057 AJ343641 Hs.185798 ESTs 3.32 

436280 AI690734 Hs.131740 Honx>saptensc^FU22562fe*doraH 331 

449365 AW968261 H3.11B913 ESTs, Moderately similar to T46371 hypot 131 

44C559 AF134160 Hs.7327 daudhl 130 

43110 AA704899 ttt291651 ESTs, WeaWysfrnfar to 13022 hypofitefi 129 

433862 D86980 H&3610 KWAO205 gene product 129 

424624 AB032947 Ks.151301 Ca2^fepenooitacfivator protein for seer 129 

439955 AW2Q3959 Ks.149532 ESTs 128 

417333 AL157545 Hs.42179 bnvnodamatnand PHD finger containing, 3 128 

43613) AW510927 Hs.125243 ESTs 127 

414900 AW452420 H&248678 ESTs 126 
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439349 AI660898 Hs.195602 ESTs 325 

42825 AE62747B Hs.187670 ESTs 324 

438217 T53925 Ha 107 arinogen-fte 1 324 

429083 Y09397 Hs2Z7817 B02-felatedpi^nA1 324 

422244 Y08890 Hs.113503 kaiyopherta{bnpoifin)beta3 322 

430178 AW449612 Hs. 152475 ESTs 321 

413810 AW197644 Hs.19107 ESTs 320 

428728 NMJM6625HS.191381 hypdreM protein 320 

437151 AA745618 Ha.194637 BANP homotog. SMAR1 homotog 3.19 
427051 BE178110 Hs.173374 Homo sapiens cDNARJ10500fe, done flT 3.19 

438378 AW970529 HsX6434 typoffiefical protefn RJ21B16 3.19 

439943 AW083789 Hs.124620 ESTs 3.18 

439280 A1125436 Ks.48752 ESTs 3.18 

452338 AA960961 Ha305953 zinc finger protein 83 (HPF1) 3.17 

433713 AW976511 Hs.1 12592 ESTs 3.16 

414998 NM.002543HS.77729 axkfisedbwdens^DpopfOteli{tectin 3.14 
407328 AA508857 Hs.187748 ESTs. WeaHy similar to ALUOttJMAN ALUS 3.14 

432722 AA830532 H&32B150 ESTs 3.14 



419457 AA243148 Hs209334 ESTs, Moderately similar to S23AJflJMANP 3.11 
449987 AW079749 Hs.184719 ESTs, WeaWy shn3ar to ALU1_HUMAN ALU S 3.11 
418522 AA605038 Hs.7149 Homo sapiens cDNA:FU21950te, (tone H 3.09 
409969 AVK14668 Hs.194258 ESTs, Moderate* sinter to ALU5_HUMAN A 3D8 



436299 AKD00767 H&5111 hypoftefical protefn RJ20729 3X8 
406687 M31126 Hs272620 pregnancy specffic b^a-1-grycoproteln 9 3.07 
408242 AA251594 Ks.43913 PBF1 gene product 3.07 
444614 R44284 Hs2730 heterogeneous nuctearftonudeoproteln 3.06 
459407 N92114 gteza22h11j1 Scares fetal Bverspteen 3X5 

433972 A1878910 H&3688 dspla^res^aiice-assodatedovefBxpr 3X4 
427704 AW971063 Ha 292682 ESTs 3.03 
440255 AI932285 Hs.160569 ESTs 3X3 
424542 A1860558 Hs2720Q9 ESTs, WeaJdysanDar to AU£_HUMAN ALUS 3.03 
413822 R08950 Hs272044 ESTs, WeaWy similar to ALU1_HUMAN AUU S 3.02 
433944 AL117518 H&3688 WAA0978 protein 3X1 



440428 BE560954 gtx601347719F1 NIHJ*GC_B Homo sapfens cD 3.00 



224 



WO 02/068677 



PCT/US02/06001 



— TABLE 21 A 

Table 21A shows the accession numbers for those pkeys lacking unigendD's for Table 21 A. 
For each piobeset we have listed the gene cluster number from which the oligonucleotides 
were designed Gene clusters were compiled using sequences derived from Genbank ESTs 
andmRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 

Ptey: Unique EcepntesetWenffiernuniber 

CAT number: Gene dusts' number 
Accession: Genbank accession numbed 



Pkey CAT Number Accession 

AA302840T93016T92950AA077551 
BE177661 HC6215 BE1447D9 BE144829 
R07785TB5948TB6972 
AA740616AA654854AA229923 

N99638 AW973750AA328271 H90994 AA558020AA234435N59S99R94815 
AL1 18754 AA3332Q2H38001 
AA464964M85405AA947566 
AA534222AA632632TB1234 
T85301 AW517087AA601054 BE073959 
BBQ05398 AA628622 AA994155 
AA657494A1582663AI581639 
BHM6584 BE04G667 AA8285B5 AI207343 
AW979249D63277AA846968 

A1376551TB7714AA897445 
AA228357 AWB41788 AW841716 



409760 115373 J 
413497 1373771J 
417670 1692153.1 
418876 179960J 
419145 182217J 
423974 233842J 
430068 312849J 
432340 345248.1 
433586 370470 J 
434280 382816 J 
434966 396504 1 
438982 467390 J 
439848 477806 J 
440428 49370.-1 
440638 499025.1 
456332 179104J 
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TABLE 21B 



Ptey: Unique number corresponding to an Eos prabeset 

Ret Sequence source. Trie 7 aTgft numbers tn this column are Genban* (darrtier (Gf) numbers. "Dunham L ei aL' refers to the pub&aEcn 

enffledTheDNA sequence of human chromosome 22." Dunham L et aU Nature (1999) 402:469495. 
Strand: Indicates DMA strand from which exerts were predicted. 

NLposffion: Indicates nudeotida positions of predicted exons. 



Ptey Ref Strand NLposffion 
404571 7249169 Minus 112450-112648 
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TABLE 22: 177 GENES DOWN-REGULATED IN COLON CANCER 
DERIVED LIVER METASTASES COMPARED TO NORMAL COLON 
TISSUE 

Table 22 shows 177 genes down-regulated in colon cancer derived liver metastases 
compared to normal colon tissue. These were selected from 59680 probesets on the 
Aflymetrix/Eos Hu03 GeneChip array such that the ratio of "average" colon cancer derived 
liver metastases to "average" normal colon tissues was less than or equal to 025. The 
"average" colon cancer derived liver metastases level was set to the 50th percentile. The 
"average" normal adult tissue level was set to the 50th percentile. 

Ptey: Unique Eos probesetMenfiBermmter 

ExAccn: Exemplar Accession number, Genbank accession number 

UnfceneJD: linger* number 

UnkjeneTffie: Unjgenegeneffie 

R1: Genes down mete vs. noma) 



t 

Pkey ExAccn UnlgeneJD UnJgeneTffla R1 

425196 AL037915 Hs.155097 (XiboxanhydraseU O03 

414522 AW518944 Hs.76325 step DspSdng factor SLU7 O03 

409153 W03754 H&50813 hypothefcai protein RJ20Q22 003 

452594 AU076405 Hs£9981 solute earner famfly 26 (sufetBtansp O03 - 

424325 NM_014479Hs.145296 (fislntegrin protease 0.04 

414798 AE86323 Hs.97411 hypomeficaJ protein MGC12335 0.04 

432150 AKD0Q224 Hs£72789 hypofoefeaJ protein FU2Q217 0.04 

4252Q8 NMJJ02153H3.155109 hydracystercH (17-bela) dehydrogenase 2 0.05 

437145 AF007216 H&5462 solute earner famfly 4, sodium bfcaibon 0.05 
447513 AW955776 Hs.313500 ESTs, Moderately similar to AUJ7_HUMAN A 0.05 

414807 AI738616 Hs.77348 tydraxyprosfaglandln dehydrogenase 15-(N 0.06 

428934 AF039401 Hs.194659 chloride channel, cafcium acfivated.fern 0.05 

432251 AW972983 Ha232165 polycytonia rubra vera 1; eel surface O07 

431727 AW293464 Hs.162031 ESTs 0.07 

421515 Y11339 Hs.105352 GatNAcaPa^&^raly&rarsfeTaseU 007 

414555 N98569 Hs.76422 phosphofpaseA2 f group IlA(pbteIets, 0.08 

412047 AA934589 Hs.49696 ESTs 0.08 

412056 728160 Hs778 guanyiala cydase activator 1 B (retinal 0.08 

422440 NlvL004812Hs.11b724 aidc^ reduce f^1, member B10 0.08 

450684 AA872605 Hs25333 IntBrteuHn 1 receptor, type U 0.09 

418935 T28499 H&A9485 rarbonfcanhydraselV 0.09 

433658 103678 Hs.1561 10 hinurogtobufci lappa constant O09 

422260 AA315993 4s. 105484 regeneraflng gene type IV 0.09 

4333% AF017986 Hs31386 secreted nizz!e(ke^ protein 2 0.09 

426784 U03749 Hs.172216 diionxgranln A (paratiyroid secretorypr 0.09 

441888 AI733306 Hs.128071 hypome&a)piotenFU21302 O10 

440624 AF017987 Hs.7306 secreted fczled-fBtated protein 1 0.10 

420929 AJ694143 Ha296251 programmed ceOdea&i 4 0.10 

429970 AK000072 Ks^27059 chloride channel calcium acfivatel, ram O10 

417233 W25005 Hs^4395 small inducible cytokine subfemUy B (Cy 0.10 

414802 AI793107 Hik27018 Rb 0.10 

424566 Ml 6801 Hs.1790 nuclear receptor subfamily 3, group C, m 011 

421996 AW583807 Hs.1460 glucagon 011 

423371 AU076819 He. 1550 sdute earner family 26, member 3 0.11 

406741 AA058357 Hs.74468 carciaoembryonlc antgen-refated cell ad 0.11 

414176 BE140638 Hs.75794 endomefe! differentiation, lysophospha 011 

408741 M73720 Hs£46 cait)Qxypep&iaseA3(mastcei} 011 

424677 AW138558 H&267158 ESTs, Weakly similar to I54374 gene NF2 012 

426682 AV66QG38 fe20K UDP pjycro y ttfansferase 1 family, polype 012 

453987 AW009077 H&232947 ESTs 012 

425920 AL04S977 Hs.162209 cfaudinB 013 

408134 AKD00184 Hs.42945 add sphiigomYerinase-&B phosprxxfeste 013 

457407 AA5Q5Q35 Hs.195651 ESTs 013 

446500 U78093 Ks.15154 sushkepeakantalntog protein, Xchrorn 0.14 

422487 AJ01D901 Hs.198267 mucM.tracrteobn^nchial 014 

409196 NM-001874H&334673 caibacypepfldase M 014 

416426 AA180256 H&210473 Homo sapiens cDNAFU14672 8s, done PL 014 

406636 L12064 gfcHcmo sapiens {done WR4.12VL) antHh 0.14 
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457982 AW856CQ3 Ks.183617 ESTs 0.14 

407744 AB020629 K&38095 ATP-Wndlng cassette, sub-fam9y A (ABC1 0.14 

430378 Z29572 Hs2556 tumor necrosis fac tor receptor superfeml 0.14 

424885 AQ33771 Hs.82204 ESTs 0.14 

423566 AW958201 Hs.178589 hepatocetecaidnomaan^engBn8S2 0.14 

444237 AA336878 Hjl9842 Hunan DMA sequence from done RP4-788L20 0.14 

445848 AA774824 Hs.13377 Homo sapiens done 23649 and 23755 unkno 0.14 

451082 AL110125 Hs25910 Homo sapiens mRNA; cDNA DKFZp564C1416 (f 0.14 



436485 X59135 Hs. 1561 10 kmumoglcbuin kappa constant 0.14 

423655 AA722425 Hs.182785 ESTs, Moderately slmiar to 1207283A rev 0.15 

417332 AW972717 Ha288462 hypoihefical protein FU21511 0.15 

427506 AKD00134 Hs.179100 hypothefical protein FU201Z7 0.15 

430712 AW044647 Hs.196284 ESTs 0.15 

421669 AL035250 Hs.1408 endothefin3 0.16 

42692 D90041 Hs. 155956 N^acBty&ransfefBS8 1 (aryiantoeN*cety ai6 

429412 Nty.006235H9.24O7 POU domain, dass 2. associating facto 0.16 

433745 AF075320 Ha28980 hypotnefical protein FU14540 0.16 



450085 AW293791 Hs.60162 Homo sapiens cONA:RJ21528fis. done C 0.16 
417820 D87449 Hs*2635 UDP-glucwonfc acMOJDP^iacety^afectDS 0.16 
406722 H27498 Hs293441 Homo sapiens SNC73 protein (SNC73) mRNA, 0.16 



428488 XQ3350 HS.4 alcohol o^hydrogenase 1B (ciass 0, bete 0.16 

438327 AA813075 Hs.120181 ESTs 0.16 

408873 AUM6017 Hs.182278 calmoourm2(pJxJsphoo^kira^ tfelt 0.16 

429524 AB033037 Hs205293 WAA1211 protein 0.16 

447023 AA356764 Hs.17109 integral membrane protein 2A 0.17 

424264 D80400 Hs239388 Human DNAseouence from dene RP1-304B14 0.17 

410310 J02931 Hs.62192 coagufeiSon kctof Hi (thromboplastin, 0.17 

432563 NM_013261rfe.198468 penxdsoms proteafive acfivated recep 0,17 

408897 M57417 grxHomo sapiens mucin (mucin) mRNA, part 0.17 

4510% BE383234 Hs25925 Homo sapiens, done MGC:15393, mRNA, com 0.17 

447726 AL137638 Hs.19368 matrilln2 0.17 

409549 ABQ29015 Hs54886 phosphoBpase C, epsfai 2 0.17 

433334 A1927208 Hs231958 matrix rnefeloproteJnase 28 0.17 

42849 AJ000512 Hs296323 serum/lgiuoocomcofai regulated toiase 0.17 
407360 X13075 gtxHuman 2a12 mRNA tor kappa^mmunogtobu 0.17 

430627 U61148 H&247685 atonal homoJog 1 (Dfosophfla) 0.17 

418807 NM.004944HS.88646 ctocynT)onucteaseHD©3 0.18 

453399 Z70295 Hs32966 guanytate cyclase acfivator 2B (uroguany 0.18 

422994 AW8916Q2 Hs296276 ESTs 0.18 

432134 A1816782 Hs.122583 hypome&al proteh RJ21934 0.1B 

400417 X72475 0.18 

443506 H10661 Hs.192124 ESTs, Weakly slmflar to 138022 hypotneS 0.18 

428470 ACC02301 Hs.184507 Homo sapiens Chromosome 16 BACctoneOT 0.18 

451928 A1823801 H&30315 Cmtmw antigen se57-1 0.18 

429576 BE242628 Ks209061 sudD (suppressor of bM6,AsperglIusn 0.18 

422106 D84239 Hs.1 11732 Fcnagment of tgG KnoTng protein 0.19 

430304 AL122071 H&238927 Homo sapiens mRNA; cDNA DKFZp434H1235 (f 0.19 

452852 AX001972 Ha30822 hypomeifca! protein FU1 11 10 0.19 

421904 BE143533 Hs.109309 rrypomefcai protein FU20035 0.19 

417165 R80137 Hs3Q2738 Homo sapiens cDNA: HJ21425 fis, done C 0.19 

417771 AA804698 Hs42547 retinofcadd receptor iesporater(tozaro 0.19 

452802 AU076403 Hs323468 electroirtnjnsfen1r#4i^^ 0.19 

450680 AF131784 Hs25318 Homo sapiens done 25194 mRNA sequence 0.19 

420061 AWQ24937 Hs29410 ESTs 0.19 

426828 NMJX)0020Hs,172670 activto A receptor type tHte 1 0.19 

409190 AB032963 H&43577 ATPase, Class I, type 88, member 2 0.19 

437682 AA476652 Hs34952 Homo sapiens cDNA: FUB3371 fis, done H 0.19 

449110 H56112 gteyq95107.r1 Scares fetal foer spleen 0.19 

446727 AB011095 Hs.16032 WAA0523 protein 0.19 

408395 BE072425 Ks.44579 hypomefeal proteh FU20199 020 

423541 AA296922 Hs.129778 Gastrointestinal pepfide 020 

410850 AW362867 Hs.302738 Homo sapiens cDNA; FLJ21425 fis. done C O20 

412420 ALD35668 Hs.73853 bona mofphogenette protein 2 020 

423942 AF2Q9704 Hs.135723 gtyooflpid transfer protein 020 

421832 NNL016098HS.108725 HSPC040 protein 020 

459046 AA910339 HsJ26216 LOC50627 020 

421360 AA297012 Hs.103839 erythrocyte rnembfane protein band 4.1-tl O20 

438091 AW373C62 Hs£3823 nuctear receptor siirfamiiy 1 , group I. m 020 

403047 020 

421712 AKD00140 Hs.107139 hypomefcal protein 020 

427333 AF067797 Hs.176658 aquaponnS 020 

421964 X73079 Hs288579 pdymerfc ImmunogtobuIIn receptor 020 

438089 W05391 Hs.83623 nuclear receptor subramBy 1, group I, m 021 

445200 AA084460 Hs.12409 somatostatin 021 

404854 021 

426390 AA377299 Ha90431 ESTs 021 
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403381 021 

449833 R82252 Hs.106106 protein kinase (cAMPniependent, catalyfi 021 
457718 F18572 Hs22978 ESTs, WeaMy sfcnDar to ALJLWJflJMAN ALU S 021 

43573) ABQ2D635 Hs.4984 KJAA0828 protein 021 

431518 AA743462 Hs.165337 ESTs 021 

4125® R28660 Hs^4305 ESTs 021 

432584 AA928829 Hs.47099 hypo&efcaJ protein FU21212 021 

426088 AF O38007 Hs.166196 ATPase, Ciass I, type 8B, member 1 021 

429143 AA333327 Hs. 197335 ptoragWanBtecaibaxypepfidase 021 

414429 R51494 Hs71818 ESTs 022 
439670 AF088076 Ha£9507 ESTs.WeaMysimBartoACC048583U1sm 022 

406697 M21388 Hs.123017 Human uiproducftrely reammged Ig niKh 022 

406663 U24683 tk302063 immunogbfaufin heavy constant mu 022 

407811 AW1S0302 Hs.40098 cystehe knot superfemSy 1, BMP antagmi 022 

417880 BE241595 Hs.82848 setec&n I flyrnpho^adhesknirnoteGub 022 

430107 AA465293 Ha.105069 ESTs 022 

424273 W40460 Hs.144442 phosptoGpaseA2, group X 022 

419559 Y07828 Hs31096 ring firmer p rotew 022 

413517 N76712 Hs.44829 ESTs, Weafysndarto 138022 typottefi 022 

407243 AA058357 Hs.74466 canira»nt»ycnican^Heiatedceflad 022 

433908 AI167816 Hs.43355 ESTs 022 

446203 Z47553 Hs.14286 flavin containing monoaxygenase 5 022 

403740 022 

405701 022 

413554 AA319146 Hs75426 seoEtogrann n (chroraogranln C) 022 

419577 L36531 K&91296 titegrh. alpha 8 0.23 

451820 AW058357 Ha337353 ESTs 023 

424897 D63216 Hs.1 53684 frizztekeJated protein 023 

422880 AF228704 Hs.121524 gkitafoione reductase 023 

430832 AHJ73913 rte.100S85 ESTs, WeaWysimSar to JE0350 Anterior 023 . 

430753 AJ432401 Hs2659 Bbrtnogen-t3ke2 023 

409060 A1815887 Hs£0130 necdln (mouse) homoiog 023 

412228 AW503785 Hs.73792 comptef7£fitccfnporient(3d/Epst^ 024 

414171 AA36D328 Hs465 RAP1A, member of RAS oncogene family 024 

417916 NM.006416HS.82921 sohite carrier famly 35 (C&ff>«iafcad 024 

414589 AA149791 Hs£8864 ESTs, Weakly similar to phcsphatxfyfseri 024 

427167 AI239607 Hs39196 hypothefeal protein MGC11324 024 
440630 BE561430 Hs^39388 Human DNA sequence fiom clone RP1-304B14 024 

423044 AA320829 Hs.97266 protocadttrtn 18 024 

441931 BE564830 Hs23744 hypothefcal proteki FU12899 024 

443060 D78874 HsJ944 procoSagen Oendopepfidase enhancer 2 024 

405441 024 

407241 M34516 gfcHuman omega flght chain protein 14.1 024 

415165 AW887604 Hs.78065 complement con^onent 7 024 

426447 AV655843 Hs.169919 eJedran^rar^-fevcpfotein. alpha po 024 

410748 BE3 83816 Hs.12532 cttrofnosome 1 open reading frame 21 024 

436032 AA150797 Hs.109276 (alexin protein 024 

414256 AW410035 Ha.75862 MAD (mothers gainst o^caperrtapJ8gjc,Dr 024 

414197 W44877 H&55501 ESTs 024 

406836 AW514501 Ks.156110 immurogbbtiin kappa constant 024 

437083 AW082597 Hs244862 ESTs 025 

421709 AA159394 Hs.107056 CH)-6protem 025 

426512 AW511656 Hs.170177 Meisi (mouse) homoiog 025 
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TABLE 22A 

Table 22A shows the accession numbers for those pkeys lacking unigenelD's for Tables 
21 A. For each probeset we have listed the gene cluster number from which the 
oligonucleotides were designed. Gene clusters were compiled using sequences derived from 
Genbank ESTs and mKNAs. These sequences were clustered based on sequence similarity 
using Clustering and Alignment Tools (DoubleTwist, Oakland California). The Genbank 
accession numbers for sequences comprising each cluster are listed in the "Accession" 
column. 

Pkey: Unique Eos probeset Identifier number 

CAT number. Gene duster number 

Accession: Genbank accession numbers . . 

Pkey CAT Number Accession 

449110 738430 J H561 12 H58047 AJ630710 N5Q742 
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TABLE 22B 



Ptey: Unique number oonespondlng to an Eos protest 

Ret Sapience source. Tte7dWnumbei5lntec(>lunnia^ -Dun^LetaL*r5{astD8t8putiIcaSon 

en^^DNAseo^eiu»ctfhHianc^^ Dunham LetaUNalure (1999) 402:48*495. 

Stramt tooicatesDrM stand from whto 

NLposfflon: Indicates raideo8te|M^ 



Ptey Ref Strand NLpostbon 

403047 3540153 Mnus 59793-53968 

403381 9438267 f*ius 2600936178 

403740 7630882 Phis 85504-87227 

404854 7143420 Plus 14260-14537 

405441 7408124 Plus 100952-101283 

405701 4263751 Plus 9324*43364 
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TABLE 23: 175 GENES UP-REGULATED IN COLON CANCER 
DERIVED LIVER METASTASES COMPARED TO COLON CANCER 
PRIMARY TUMOR SAMPLES CLASSIFIED AS DUKE'S B SURVIVOR 

Table 23 shows 175 genes up-regulated in colon cancer derived liver metastases compared to 
colon cancer primary tumor samples classified as Duke's B stage with a positive survival 
outcome (Duke's B survivor). These were selected from 59680 probesets on the 
Affymetrix/Eos Hu03 GeneChip array such that the ratio of "average" colon cancer derived 
liver metastases to "average" Duke's B survivor was greater than or equal to 3.0. The 
"average" colon cancer derived liver metastases level was set to the 50th percentile. The 
"average" Duke's B survivor level was set to the 50th percentile. 



Pkey: Unique Eos probesei Identifier number 
ExAccrc Exemplar Accession number, Genbank accession number 



UnlgenelD: Unigene number 
Unpens Tffie: Ungate gene fife 

R1: GenesuptomefastesesvsDukB^Bstiivivors 



Pkey EzAccn Urigenefl) Unigene Tffle R1 

426101 AL049987 Hs.166361 Homo sapiens mRNA; cDNA DKF2p564F1 12 (fr 9.06 

432572 AI660840 Hs.191202 ESTs, Weakly simaarto ALUEJflJMAN UK 7.96 

424878 H57111 H&221132 ESTs 7.88 

428046 AW81Z795 Hs.155381 ESTs, Moderately similar to 138022 hypot 7.48 

407284 AI539227 Hs314039 hypofoe&al protein FU23556 7.45 

439943 AVW83789 Hs.124820 ESTs 7.00 

442369 AI565071 Hs.159983 ESTs 7.00 

415116 AA160363 H&269956 ESTs 6.93 

433517 AW022133 Hs.189338 ESTs 6.70 

437176 AW176909 Hs.42346 calrjmeurWxndfng protein cafeardn-1 6.68 

440524 R71264 Hs.16798 ESTs 6.62 

408806 AWB47814 Hs.289005 Homo sapiens cONA: FU21532 fe, done C 658 
448974 AL049390 Ks22689 Homo sapiens raRNA; cDNA DKFZp58601318 (f &28 

412088 AI6894S8 Hs.108932 ESTs 6.04 

417670 R07785 gfcyf15c06j1 Soaies fetal Bverspteen 535 

440774 Ai420611 Hs.127832 ESTs 551 

426086 T94907 Hs.188572 ESTs 5J0 

436100 AA704806 Hs.143842 ESTs, Weakly similar to 2004399A chromos 554 

453204 R10799 Hs.191990 ESTs 554 

407289 AA135159 Hs3Q3349 Homo sapiens cDNA RJ12149 fis, ctone MA 557 

432435 BE218886 H&282070 ESTs 551 

434963 AW974957 H&288719 Homo sapiens cDNA FU12142 6s, clone MA 5.60 

421221 AW276914 Hs526714 Homo sapiens done IMAGE713177, mRNA se 554 

407328 AA508857 Hs.187748 ESTs. Weakly simBar to ALUIJWMAN ALUS 551 

440980 ALD42D05 Hs.11 17 tripepfldyl peptidase II 5.48 

443651 W22152 Hs 282929 ESTs 5.42 

412668 AA456135 Hs.10056 hypofiiefical prateb FU14621 5^9 

444838 AV651680 H&208558 ESTs 5.24 

433312 AI241331 Hs.131765 ESTs, Moderately similar to 138937 DNA/R 5.11 

430665 BE350122 Hs. 157367 ESTs, WeaHy sknter to I78885 serine/m 5.11 

434966 AA657494 gbjrf56f04.s1 NCLCGAPJtt Homo sapiens 5.10 

426897 AW976570 Hs57387 ESTs 5.05 

432954 AI076345 HsJM4199 ESTs 5.07 

431416 AA532718 Hs.178604 ESTs 5.00 

420717 AA284447 H&271887 ESTs 4.96 

424950 AA602917 Hs.156974 ESTs 454 
438962 BBM6594 gMm41c11j(1 NCLCGAP_RDF2 Homo sapiens4.92 

419999 AI760942 Hs.191754 ESTs 4.89 

435812 AA700439 Hs.188490 ESTs 4.86 

418662 AI801098 Hs.151500 ESTs 4.79 

428065 AI634046 Hs.157313 ESTs 4.77 

407618 AW054922 HS53478 Homo sapiens cONA FU12366 fis, dona MA 4.75 

435981 H74319 Hs.188620 ESTs 4.74 

419145 N99638 gb2a39g1 1/1 Scares fetal Ever spleen 4.73 

432340 AA534222 gfcn£1d02jj1 NCLCGAP_AA1 Homo sapiens 4.72 

447982 H22953 Hs.137551 ESTs 4.72 
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449508 AA001615 Hs34561 ESTs 472 
407946 AA226495 Hs.154292 ESTs 4 JO 

43607 AW08QZ37 Hs2S834 ESTs 4-68 
438406 BE273296 hte2544S7 Homo salens cDNAFU1325fis,doneOV 432 
426818 AA5548Z7 Hs289115 DKFZp434A0131 protein 432 
452220 BE158Q06 Hs212296 ESTs 4.60 
436823 AW749865 Hs33845 ESTs, WeaHyshiflar to 138022 hypofosfi 430 
433854 AA610649 Hs333239 ESTs 456 
413816 AW958181 Hs.189998 ESTs 432 
428079 AA42102D Hs208919 ESTs 452 
421097 AI280112 H3.12B32 Homo s ap iens cONAFU13266fis t done OV 450 
417035 AA192455 HsJ22S58 rtamo sapiens done O^GE-451 939, mRNAse 4.48 
423974 AL11S754 gfcDKFZp761P1910_r1 761 {synonym: hamy?) 4.44 

449618 AI076459 Hs.15978 WM1272pnMi 4.44 
431615 AW29S859 Hs235860 ESTs 444 
418876 AA740616 gtaiy97fl1.s1 NCLCGAP GC81 Homo sapiens 4.43 

428279 AA425310 Hs.155766 ESTs, Weakly sMter to A47582 B-cel gr 4.42 
430573 AA744550 Hs.136345 ESTs 442 
430929 AA489166 Hs.156933 ESTs 4.40 
446099 T93096 Hs.17126 hypofefcafl prelum M6C15912 4.40 
439362 AI954880 fe.134604 ESTs 436 
421999 U50535 Hs.110630 Hmm BRCA2 region, mRNA sequence 435 
434220 A1174777 Hs283039 Homo sapiens PR02492 mRNA, complete cds 4.33 
432925 AAS78324 Hs.192734 ESTs 432 
417819 A1253112 Hs.133540 ESTs 430 
426848 H72531 Hs36190 ESTs 430 
429831 AA564489 Hs.137526 ESTs 430 
433735 AA6Q8955 Hs.109653 ESTs 430 
418884 AA230228 H&59197 ESTs 4.28 
413243 AA769266 Hs.193657 ESTs 426 
431749 AL049263 Hs.306232 Homo sapiens mRNA; cDNA DKFZp564F133 (fir 423 
428054 AI946688 Hs266619 ESTs 422 
413957 AW204431 Hs. 117853 ESTs, WeaWy simfer to 138Q22 hypotheti 422 
433230 AW136134 Hs22Q277 ESTs 422 
421057 T58283 Ha.10450 Homo sapiens cONA: FU22063 (is, done H 422 
423578 AW960454 Hs222830 ESTs 421 

439717 W94472 Hs39529 ESTs, Moderately similar to ALU1 JftJMAN A 420 

443696 AW607444 Hs.134622 ESTs 420 

432722 AA830532 Hs326150 ESTs 4.18 

435756 AI418466 Hs33G65 ESTs 414 

428825 A1084336 Hs. 123783 ESTs, WeaJdysin3ar to C8Q22hypo9»efi 4.14 

439451 ARJ86270 Hs278554 heterochroirafiivte pro tehl 4.12 

445943 AW898533 Hs.181574 ESTs 4.12 

450219 AI826999 Hs224624 ESTs 4.12 

431379 AA504264 Hs.182937 peptkiytpmJyt isomers 4.11 

432451 AW972771 H&292471 ESTs, Weakly similar to ALU1_HUMAN ALU S 4.10 

443148 A1034357 Hs211194 ESTs, Wealdyslmfiar to ALU8JiUMAN ALUS 4X8 

450177 AI698091 Hs.107845 ESTs 4X8 

420911 U77413 Hs.100293 CMnked f^acetyigliicosamha (GfcNAc) tr 4XB 

421114 AW975051 Hs293156 ESTs, Weddy similar to 178885 serinefth 4.06 

432731 R31178 Hs287820 Cbronecfin 1 4X6 

433588 AKJ56872 Hs.133386 ESTs 4.06 

434658 AJ624436 Hs310286 ESTs 4X6 

444040 AF204231 Hs.182982 go#rv67 4X6 

429512 AA453987 Hs. 144802 ESTs 4X6 

443349 A1052572 Hs269864 ESTs, Weaidy sfmfar to ALU1_HUMAN ALU S 4X4 

439867 AA847510 Hs.161292 ESTs 4X4 

425955 T96509 Hs248549 ESTs, Moderate^ shtiiar to S65657 alpha 4.02 

431393 AW971493 Hs.134269 ESTs, Hkjhfy simflar to cytokine recepto 4X0 

432125 AW972667 Hs287510 Homo sapiens cDNA FU12300 fe, done MA 4X0 

4354® AW362803 Hs.166271 ESTs aX7 

412(39 AA317962 Hs249721 ESTs, Moderatefy sim2ar to PC4259 ferri 3X5 

446682 AW205632 Hs211198 ESTs 3X5 

441328 AI982794 Hs.159473 ESTs 3X2 
455778 BE088746 gb.-CM2^T0693-21 030O-1ZWD9 BTC693 Hocra 3.90 

438996 AW748336 Hs.168052 K1AA0421 protein 336 

418303 AA215791 Hs, 188541 ESTs, WeaWy simBar to 138022 hypoOtefl 335 

444816 Z48633 Ks283742 Ksapiens mRNA for letrofransposon 334 

429355 AW973253 Hs^2889 ESTs 333 

438578 AA811244 Hs.164168 ESTs 333 

432945 AL043883 Hs3173 hypofteficai protebi FU10803 333 

435318 T97301 Hs.18026 ESTs 332 

449941 AW450538 Ks2092G0 ESTs 330 

424915 R42755 Hs23Q96 ESTs 3.76 

449987 AW079749 Hs.184719 ESTs, Weaidy sftnter to ALUIJflJMAN ALUS 3.76 

416265 AA177088 Hs.190065 ESTs 3.75 
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413497 BE177681 gb«C1-HrOS8^2030(M)11-hQ2 KT0593 Homo 3.74 



412093 BE242691 Hs.14947 ESTs 3.74 

413822 R08950 H&272044 ESTs, WeaHy similar to ALUIJiUMAN ALUS 3.73 

431915 AKD00777 H&272197 Homo sapterscDNAFU20770fe, done CO 3.68 

434442 AA737415 Hs.1 52828 ESTs 3l63 

434959 AW974949 Hs. 186564 ESTs, Weakly similar to 138022 hypothefi 353 

427704 AW971063 Hs.292882 ESTs 3.62 

426510 AWB61225 Hs, 194637 BANP homolog, SMAR1 homotog 350 

435714 AA699325 H&269880 ESTs 3.60 

432598 AJ341227 Ks.157106 ESTs 357 

438543 AAB10141 Hs.192182 ESTs 355 

422088 A1807519 Hs.104520 Homo saptenscDNAFLJ13694fis, dons PL 354 

418259 AA215404 Hs.137289 ESTs 354 

428290 AI932995 Hs. 183475 Homo sapiens done 25061 mRNA sequence 3.49 

419457 AA243145 Hs209334 ESTs, Moderately similar to S23AJHUMAN P 3.47 

439312 AA8339Q2 Hs270745 ESTs 3.47 

408784 AW9713S0 Hs53386 ESTs 3.45 

456332 AA228357 gfcnC39d05j1 NC|_CGAPJ>r2 Homo saptens 3.45 

424762 AL119442 Hs.183684 eukaryotfc transfeflon HrJafttt factor 3.44 

442884 AKJ76570 Hs.134053 ESTs 3.44 

421023 AW449855 H&96557 Homo sapiens cDNA RJ12727 fis, ck>ne NT 3.43 
434575 AJ133446 Hs^99954 Homo saptens cftx le FLB7723 PRO2055 mRNA, 3.42 

430433 AA478883 H&273766 ESTs 339 

419317 AA236282 Hs.172318 ESTs 338 

448710 T62926 Hs304184 ESTs 337 

439322 H72245 Hs.188635 ESTs 337 

430332 R51790 H&239483 Human done 23933 mRNA sequence 335 

411755 BE327Q36 Hs.1 17494 ESTs 333 

427882 AA640987 Hs.1 93767 ESTs 3.28 

438899 AFD85833 Hs.135824 ESTs 328 

436535 AW295687 Hs354420 ESTs 3J25 

434936 AI285970 Hs.183817 ESTs 122 

451730 AF095687 Hs£6937 brain and nasopharyngeal carcinoma susce 3.18 

447514 AI809314 Hs.208501 ESTs. WeaJdy state to B34087hypoftefi 3.18 
413672 BE156536 gb.-QVO+fTO36W1010(W»1-h10 HT0368 Homo 3.16 

435073 AA664078 flbac04a05.s1 Stratagene tung (937210) H 3.13 

450295 A/766732 H&210628 ESTs 3.13 

419341 N71463 Hs.1 18888 ESTaWeaMy slmflar to ALU1JHUMANALUS 3.13 

434495 AW352170 Hs, 129086 Homo sapiens cDNA FU12007 fis. done HE 3.12 

408113 T82427 Hs.194101 Homo sapffins cDNA: FUXP&Q9 fis, done A 3.12 

456437 AI924228 Hs.115185 ESTs, Moderately similar to PC4259 ferri 3.12 

421489 AB22B21 Hs32433 ESTs 3.12 

436090 AI640635 Hs.1 16468 EST 3.11 

450230 AW016607 H&201582 ESTs 3.11 

438011 BE466173 Hs.145696 spScmg factor (CC1 3) 3.09 

418720 AJ381687 Hs39526 ESTs 359 

433102 AJ343966 Hs.1 58528 ESTs 3.08 

436150 AW510927 Hs.1 25243 ESTs 3.05 

440116 AI798851 Ha283108 hemogbbfo, gamma G 3.04 

414900 AW452420 Ha248678 ESTs 3JM 

435937 AA830893 Hs.1 19769 ESTs 3.02 

424848 AI263231 Hs327090 EST 352 

435354 AA678267 Hs.1 171 15 ESTs 3D0 
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TABLE 23 A 

Table 23A show the accession numbers for those pkeys lacking unigenelD's for tables 1- 
20A, 21 A, 22A, and 23A. For each probeset we have listed the gene cluster number from 
which the oligonucleotides were designed. Gene clusters were compiled using sequences 
derived from Genbank ESTs and mRNAs. These sequences were clustered based on 
sequence similarity using Clustering and Alignment Tools (DoubleTwist, Oakland 
California). The Genbank accession numbers for sequences comprising each cluster are 
listed in the "Accession" column. 



Ptey: Unique Eos probeset ktenfifiernuihber 

CAT number Gene duster number 
Accession: Genbank accessor! numbers 



Ptey CAT Number Accession 



413497 
413672 
417670 
418876 
419145 
423974 
432340 
434966 
435073 



4S778 



1373771,1 BE177651 K06215 BE144709 BE144829 

1382512J BE156536 BE156439 BE1567C0 BE156449 BE156653 BE156533 BE156524 BE156670 BE156721 BE1567Z3 

1692163.1 R077851B5948T86972 

179960J AA740616AA654854AA229923 

182217 J ^9638AW973^0AA32827lH9(e94AA558020AA234435N595»ra4815 

233842J AL118754AA333202H38001 

345248.1 AA534222AA632632TB1234 

396504J AA657494 A1582663 AI581639 

399701.1 AA664O78AW383313AA8O5009 

467390J BB046594BEG46667AAB28585AI2Q7343 

1364506J BED88746 BE088802 BED88755 BE088876 BE088947 BE088881 BEC8S952 

179104.1 AA228357 AW841786 AW841716 
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TABLE 24: 34 GENES DOWN-REGULATED IN COLON CANCER 
DERIVED LIVER METASTASES COMPARED TO COLON CANCER 
PRIMARY TUMOR SAMPLES CLASSIFIED AS DUKE'S B SURVIVOR 



Table 24 shows 34 genes down-regulated in colon cancer derived liver metastases compared 
to colon cancer primary tumor samples classified as Duke's B stage with a positive survival 
outcome (Duke's B survivor). These were selected from 59680 probesets on the 
Aflymetrix/Eos Hu03 GeneChip array such that the ratio of "average" colon cancer derived 
liver metastases to "average" Duke's B survivor was greater than or equal to 0.25. The 
"average" colon cancer derived liver metastases level was set to the 50th percentile. The 
"average" Duke's B survivor level was set to the 50th percentile. 



Ptey: UniqiraEosprobesetldefiffiermmter 

ExAccrn Exemplar Accession number, Genbank accession number 

UnigenetD: UnJgene number 

UnigeneTffie: Uragene gene title 

R1: Genes down Over metastases vs Duke's B survivors 



Pkey ExAccn UnigenefD UnlgeneTHte 



R1 



414522 AW51B944 Hs.76325 step It spBcfrig factor SLU7 

416768 AA383733 Ha.1032 regenerating bteWenved 1 alpha (pane 

409153 W03754 H&50813 hypoftetel proteh RJ20022 

414555 N98569 Ha.76422 phosphdlp8seA2. group UA (platelets, 

416007 M13509 Hs-83169 matrix metafeproteinase 1 OntersSSal 

424326 NMJM4479HS.145296 (fehtegrin protease 

428934 AF039401 Hs.194659 <m^dmy^(^it^M^1am 

417233 W25005 Hs24395 smaO Inducibte cytokine subfamily B (Cy 

422260 AA315993 Hs.105484 regenerating gene type IV 

425196 AL037915 Hs.155097 carbonic anhydrase II 

433336 AF017986 HsJ1386 secreted fttefe4ff&ted protein 2 

450685 L15533 H&423 partcreeiffis-assotiated protein 

407611 AW190902 Hs.40098 cysteine knot superfamfly 1, BrVPantagon 

414798 At2B6323 Hs.97411 hypothetical protein MGC12335 

452852 AKD01972 H&30822 hypofoeflcal protein RJ11 110 

447513 AW955776 Ha313500 ESTs, Moderately similar to ALU7 _HUMANA 

423541 AA296922 Ha.129778 gasboMesfinal peptide 

425071 NM.013989HS.154424 detodinase, todothyronine, type ll 

406636 L12064 gbitomo sapfens (done WR4.12VL) antkn 

421515 Y11339 Hs.105352 GalNAc alpha-2, 6«teiyfiransterase 1 1 

428368 BE44O042 H&83326 matrix metaltoprotelnase 3 {strometysln 

414812 X72755 Hs.77367 monokine Induced by gamma hterferon 

452594 AU076405 Hs29981 solute canter family 26 (sulfate transp 

428227 AA321649 Hs2248 smal inducfcJe cytokine subfemflyB(Cy 

408741 M73720 Hs.646 carboxypepOdase A3 (mast cell) 

453064 R40334 Hs39463 potassium targe conductance cafcainvacti 

431727 AW293464 Hs.162031 ESTs 

433658 L03678 Hs.156110 immunoglobulin kappa constant 

442064 A1422867 Hs.88594 ESTs 

417880 BE241595 Ks.82848 setecfin L (tymphocyte adhesion motecute 

430280 AA361258 Hs237B68 htedeukfn 7 receptor 

452877 At2SQ769 HsJ2478 ESTs 

410310 JQ2931 Hs.62192 coagulation factor HI (ftfomboptasfin, 

402408 



0.05 
0.07 
0.07 
0.11 

an 

0.11 
0.12 
0.12 
0.12 
0.13 
0.13 
0.14 
0.15 
0.16 
0.17 
0.17 
0.17 
0.18 
0.18 
0.18 
0.19 
0.20 
020 
021 
021 
021 
022 
022 
022 
022 
023 
023 
024 
024 
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TABLE 24B 



Ptey: UnkpianrntoGonespomfingtoaiEosprobeset 

Refc Sequence sourca The 7 djgfi numbera in tire cofarai are Genbank ktenSier (GI) number "Dunham Letai'refera to QtepubEcafion 

enftfed^im sequence erf toir^ Dunham L et aU Nafcire (1399) 402:489-495. 

Strand: Indicates DNA stand from which axons wercpredfeted 

NLposSoa* (ntoh^nudaj^posSons of predicted axons. 



Ptey Ref Strand NLposBion 
402408 9798239 ttnus 110326-110491 
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TABLE 25: 

Table 25 depicts Seq ID No., UnigeneK), UnigeneTitle, Pkey, and ExAccn for all of the 
sequences in Table 26. Seq ID No links the nucleic acid and protein sequence information in 
Table 26 to Table 25. 



Pkey: Unique Eos pmbeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

Unkjenefl): Un^ene number 

UmgeneTUte: UnJgena gene tife 

geojpjlo: Sequence kfentfflcafcn Number found In Table 26 



Pkey ExAccn Unigenefi) UnigeneTffie Seq ID Ho. 

42B101 AL049987 Hano sapiens mRNA;cDNADKFZ^564F112(fr U 

419145 N99638 gb 5&6 

426818 AA5548Z7 H&340046 DKFZp434A0131 protein 7&8 

421057 T58283 Homo sapiens cDNA 9 

446619 AU076643 Hs313 secreted phc^phopiotdn 1 (osteopontin, 10&11 

431958 X63629 H&2877 cadhenn3 l 1ype1,P-cac^(p^centa 12&13 

409041 AB033C25 HsJ0081 Hypothetical protein, XP_051860 (K1AA1 19 14&15 

443162 T49951 Hs£029 DKFZP434G032 protein 16&17 

436385 BE551618 Hs.144097 ESTs 1820 

447033 A1357412 Hs.157601 ESTs 21&22 

439608 AW864698 Hs301732 hypc*eO^protehMGC5306 2327 

449032 AA045573 Hs£2900 nudear factor (er/tJuokWerrved 2tfk 2S&29 

442577 AA292998 Hs.163900 ESTs 30&31 

429970 AKD00072 H&227059 chkinte channel cafciiima^ 32433 

424566 M16801 Hs.1790 nuclear receptor subfamily 3. group C, m 34&35 

457407 AA505035 HsJ45911 ESTs 36 

430378 Z29572 Hs2556 tumor necrosis factor receptor superfarni 37&38 

417332 AW972717 HsJ88462 rrypelr^cal protein RJ21 511 39&40 
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TABLE 25A 



Pfcsy: UiA|UBEDSprabesetldenffiernunibGf 
CAT number Gene duster number 

GenbanX accession numbers 



417332 165755 J 

419145 182217J 
421057 198849J 



Pkey CAT Number Accession 

409041 10962_2 AB033Q25 AL359061 AL045838 AT751 521 AI752304 AT752650 AAB53580 AI752290 AA853460 AI752769 AA852309 

AA853785 AA853219 AW0685Q3 AI7520S9 ALD49389 AW0S8368 BE439518 W52813 BE141833 AB40574 A1750606 AL109718 
AA242845 AA315795 AA307741 AW954603 AJ75207O AA350794 AI752649 AA307755 AW951677 AA298896 BE439632 
AA852453 AW06B826 AW853984 AA418236 AA639417 AW29091 7 AI750SJ2 AI752768 AUD45837 AI926513 AW262903 
BE439819 AI459360AW339074 AW295181 AWQ29483AT750945AI750659 AI752525 AI147688 BE440122 AI751522 AI473816 
AI752291 AK94639 A192581 6 AA599476 AA242752 AWQ21 892 AI755C98 AW469299 AW769363 AAB53579 AI784C82 
AAB52454 AJ925501 AA976657 AW150473 AW166734 

AW972717 AA5238Q5 AB62905 AB73245 AW235545 A1812045 AW589434 AJ826S24 AW572339 AJ377551 AA135718 
AI868470 

N93638 AW973750 AA32SZ71 K90994 AA558020 AA234435 N595S9 R94815 
T58283 AA765038 AA283052 K39398 AA814751 A1032674 N81016 N81017 BE222349 AA83D545 
M16801 NM.000901 D57171 AL041328 AFDS8623 A12D1 179 AA151766 AA568349 AJ698649 A)692765 BE327401 AA744953 
AA744951 AW361986 AV651840T29894 AW945146 AW945145 W24096AI183952AI458972 AW190993AI765359 AB34863 
AI741201 AW418944 AI7B7551 AA679687 AW772342 AW629508 BE5043CO AJ251790 A1522294 AA724341 AW615402 
AJ537570AA470665AM58375AW68901AA447079T23537^^ N91919AA149749 
428101 26088 J ALD49987 AW362842 T78981 AA247541 A1217018 AW961515 AA632986 AA663108 BE326465 AW872412 AIQ24S89 AA453725 
BE150456 AA229448 AA442633 AA442648 AJ916737 AA460220 AA868553 A1827987 AJ005467 R31132 AI742087 AA442379 
N56349 AW769479 AJ860142 A1917507 AA813604 AI860141 AJ459289 AA522837 AB54470 AI921333 BE466760 AW971 1 93 
AW103830 AW277065 AWQ2C895 A11 87977 K28268 AI084517 R95914 AA833517 AA563934 AA437299 AA436880 AA4477& 
AA812876 AA6631 78 R31089 AJ472712 R64648 AA60Q372 AA229164 AA7Q3Q66 AW270324 All 91725 AA55151 2 AA493776 
43818 272427J AA554827 AA701001 AW972954 AL039129 AA385540 AA91 1663 

429970 31134,1 AKD00072 AW840683 AW843764 AW844444 AW844515 AW603469 AW862395 AI86G838 AW511708 AF127035 NM.012128 
AK000138 

430378 3170J Z29572 AW978377 AA286871 AA633372 AA987627 AA743176 AI865358 AJ006884 AF031 845 Z14955 
431958 3394.1 X63829 NM.001793 BET75433 BE153414 BE153425 AW364593 BE315317 AW05O19O AA314252 BE142943 AW365220 
AW368405 BBD04269 AW366568 AL040609 AB29273 AI591168 BE146183 A1631 060 AI330793 W78081 W92295 AB27422 
BEQ09313 AB71793 AW993031 AI2D4659AA535113 AW993030 AM90281 AA555159 AW269637 AW993146 AI149268 
AA425217 AW473194 AI890930 AA551993 AB52106 W92308 AI827275 W45400 AI95232B AW609233 AA774611 AA551779 
AI913987 AI798658 AB37658 AW517535 AA632236 AW339148 AW589522 AA836945 AA961263 AWD15821 AW272946 
C00249 W40333BE143121 
436385 418907 J BB51618 At207338 BE220568 AI261568 AW841/37 AA714722 AA946891 AKJ33239 

439608 47438_3 AW864696 AW338889 AI342866 AA084522 AI244150 A1610339 AA425635 AA764930 AA976955 AW805766 AA057765 

AW805845 AWBQ2595 AA262971 AB69620 N75323 BE549060 AW805725 AA025809 N80776 N64595 AW073372 AA025493 
AJ819475 AW028879 AW189496 AA442907 AW41Q38J A191 1629 N71276 AW316922 AW805838 AA043880 AW189184 
AA44975S AA748153 AA705S38 AI910643 AA279492 BE1601 19 AW805761 AA026262 AA782207 AW057652 AW805768 
H21998 AW194254 AW275178 AA449040 AA279582 N76314 N54348 

442577 54549.1 AA292998 AW238350 AIS76059 AW074092 BE566458 AW07B677 AW5 1 4801 AW073701 AW 1 70620 AI523736 AK80870 
AI923975 AB93326 AI700229 AW450814 AW628452 A1571457 AA937534 AI889694 AW339423 AW291875 AA551874 
A1682314 AI92B227 AA397375 

443162 5613.1 T49951 AA025326 H04839 AA393303 R63101 W57657 W25S28 AB61431 R71 1 65 N39940 H01548 H01759 AA641624 

AI634930 AA595296 AW994770 AW994747 BE047247 W38159 AA858133 A17D1 944 A W386273 AA676625 R24676 R79410 
AA922863 AJ 15131 9 HOI 013 AAQ24482 WD2674 H01456 AJ150858 AW135972 AW631167 AI270332 H04750 T49622 AA004543 
R63061 AUJ93086 AI247539 H01225 H03388 AW472933 AA382448 AI21 9287 N27194 AW38S613 AA649738 AW994764 
AW389614 R25176 AA897262 R71626 AA9G9471 R71240 AW811917 R76109 A12Q2312 AI866010 R76162 AL117538 R79411 
T58656AW994674 

446619 685.1 AU076643 AA594604 AA346866 R18197 AA345192 AA337773 AA089791 R84435 AA337838 AW392167 AAD75190 D5541* 

AW150380 AW366257 AA579816 H93048 AW385689 AW385697 Al 1862 16 AW581197 AL037509 AB019562 AA232626 R97905 
AW368019 AA242891 AW8885Q2 AI798331 AW385635 AW5B1221 T96847 K87989 AA369511 AA075191 R8O742AA366406 
W92752 H45586 AJ864016 AW888497 BE004992 AI3841 10 AB24256 AI627593 W92728 AI6B2719 AA948208 AA171734 
N40517 JQ478> AA379957 AA362403 NMJKXJ582 ARK2124 AA300^0 AA333447 AA343721 AW889543BE566767 R76601 
R18015 AA100531 AA489963 AA101296 AA363513 AA344088 AA335750 T77505 D56440 AL110351 AL1 10331 F12195 R20175 
AA336684 H 17766 AA363538 AA363590 D28760 AW578517 AA363531 AI814667AA846899 AA366253AW951285AA297992 
AA327756 AW3616G9 AW815455 AW815427 AW815428 D54182 AW8522M) AA171630 W27018 AW815864 AW379995 
AW378222AW382610 BE566Q22 AWQ21023 C17352 D58435 AA345409 AI623991 AWQ20987 AI924770 AI799443 AW946393 
AA991239 AI571617 A1935181 AJ923999 AI826895 AIK0319 AW189873 AW270353 AW023584 A1813811 R99929 AW339056 
AA913152 AI636352 AI829394 AW151077 AW1 92580 A1570119 AI086391 AW021764 AW519154 A1375193 AW268078 
BE465©0 AW01^83 AW268654 AI573138 AI141809 A1954553 A1S9242 AA568945 AA886417 AW338527 AI635881 
BE465666 Ai9212^ AA988537 A19560Z7 AA91 1981 AI827661 AW51 1046 BE619780 A1922227 AB11870 AW190131 
AW129220 AW5129C5 A1290757 AB19IB8 Affi23771 AA775616 BE349419 AI1^375 H88773 AC41758 AW275157 AI337848 
AJ613425 A1631387 AAS22531 AI273483 AJS828S8 AW1K957 AJ446481 BE501588 BED48264 AI499922 AW023812 BE220523 
AW973846 BE349276 AM41091 AA9760Q) AW973845 AA101270 AB82472 AW513675 A11 3930) M2B3SZI AI276044 N22345 
A1261875 AA634136 A1S24468 AW887693 N27107 R21504 A1042223 N220S7 AW196871 A1581019 BBXM973 AA252035 
N22087 AA570717 H11250 A1804Q26 AA3O098 AA021512 H08842 N26275 AA176368 AI758758 AA570371 AA232574 
BE221177 AW190221 AW471386 M78225 AJ422140 AI524521 AA719775 AA3C0291 AA568657 AI871430 BE465630 N71862 
T72587 W92721 H83774 D54383 AW103693 AW089986 Ai382^ R42363 R44962 T98770 AA357374 AW022074 AI356207 
T29241 AW089431 AQ33875 USSXT N67352 AA121786 AA363910 P09824 T95618 N66888 RKJ550 A128C667 AW196719 
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R59299 AW021049 H73469 AI95431 1 BE439454 AW079450 AW973S50 AA348338 AW896006 AW268145 AA853631 H17650 
R39537 N66873 N67240 H05298 AI7&4199 R44260 AA9041 1B AA911756 F04544 AAB07809 AA665210 A1S9644S T29719 
AA837240 T64844 HQ8926 
447033 704603.1 AJ357412AJ87O708AI590539W07459 

449032 7945 1 AAQ45573 AA279920 R2D139 AA372783 AW963629 H21473 R7831B W743K) AAQ22505 AA369091 AWKM075 AA503638 

AV560815 AC162S2 AA779843 BE21 3825 AF125534 AW972129 AS19099 AI6212B3 AI300590 AS53701 AA33141 5 AW61 0546 
AW793050 AI953879 AW793047 AW510543 AE71 103 AW292105 AW0241 12 R77947 W76339 AA30511 1 AA132523 AA227467 
H21401 AW366572 AW024129 AI701885 AB54744 BB042803 AI347173 AW366Q53 AW662710 R36639 AJ4S9777 AA962733 
AI865366AA5O1993AW866054BE178974 

457407 333252J AA5G5Q35 AW235Q98 Ai634Q2B 
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Table 26 

SeqIDNDel DNAseqpence 

NndefcAol Accession ft sre Table 25 & 25 A for compter Est 



1 11 21 31 41 51 
I I I I I I 

CAATATAOTA CAATAACT AT TTGCATGACA TTTACATCGG ATATTATGAG TGATCTAGAO 60 
TTGATATGAA GTATATGGGA GGATGTGCAA AGGTGATGTG CAAATACTAT GTCATTTTAT 120 
AGGGGGGACT TGAGTATCCT TIQ1TAOOCT CAGQAGATOC TGAAAOCAGT CCCCCATGGA 180 
T ACTGAGGGC TGACTGT ATA GTOCTATOCT CACGGAACTT TCATTCT AAT GGGGGAAGAC 240 
TGACTATAAA CAAAATATAT GTAATAGGTG GTGGTAAGTA CCGTGGAGAA GTAACAAATG 300 
GGGCAAAGTG AGTT ATACA O CTOCATTCTT AGAAACCTTG GAQTACTTTT CTTAGTTTAT 360 
ACTCGTGGTG GTTTCCT 1 IT GTCIOCl 1 1 A TTACATGGOA CTCTGACATG TGOOCATAGC 420 
TAGGGTQACA GTAGGATCTA CCCGAT ACTA GGGTGGCAGT AGGATCTACC CAAAAAGCGT 480 
CCluciuATA CAG GACCA AA GCATCCTC3 J Ul 1 CTCGAGC CTATAAAAAG AGCTAATGGT 540 
GTTGCTTCIX:TTAACTGTGG CCrCCTACAC T O l U i 1 HUG ATGATTGGTG ATGTCI 1UGA 600 
TATTCTGTTT CTTTGGAACT TTGAATATAC AACACTTTAC TAGGGAATTA GCAATGGAAG 660 
CAGAGCAAAG ATGTACAGAG GAAA CAATGC GTAACTCTGA TGGAATTGAA GTCATGAGGC 720 
AGCAGAGAGC TTAAATTACA GCTTTAAAAA TTTTTATTTT TTAGAGGGAA TTTACTTGGG 780 
AGTAACAGCA GTAATAGTTA ACGGAGCCAG AATGCTTGAG TCATATAATT GCAAAGCAGA 840 
GTTGGGAGCA ACAGATGCTA AAGAGTAGTT GCTGTAGTTC CTCnTGGGT CGTAGG AGCA 900 
GTTGTCATAT TACTATATAG CTACTGCATG AAGAAGAGTT CTTAGTGAGG OCTGGGTGAA 960 
CAGCTCTTCT TAGTATTCTG TGTG ACCCCA TTTGACCTTT TAACAAATCC CTAAGTAAAT 1020 
AAATAGCCCC TCAGGAAAAC TAAGTTTTTC T CTGC T Gl UUI IbCl 1UA GAGAGCTATA 1080 
ACTG TAATAG ACTTATATTT CTGAACATTT TAGTGCTTGC CAATATTTGG TAATATTTAT 1140 
GTTTCCTATA TTTGTAATGA ACATTCTTCT TCCGGTACAT TTTTTGTTAA ATTATTGTTT 1200 
GATGGATAAA AGTTCACCTT TTATTGTATA AAATTGACTG AGATTAATTT ATACACATTG 1260 
ACAATGGGTA AATAGAATTT TTCAGATTAT TAAAAGCTGA AGGATGACCA CGTAAGCAAA 1320 
AAAAAAAAAA AAAAAACCAA CAAAAATAAA CCCAAACCCC TCAAACAATT TCP AACACG A 1380 
AACATTCTTC TGATGCCGGC ATCCCTGCTT GCAGGTGTGA AGGGGGCAGG AATCAGCGAG 1440 
GTGTCCTGGG CTGAGTCCCC GGGGAAGAAT ATGAT 

Seo ID NO. 2 DN A sequence 
Nndek Add Accession ft X83301.1 

1 11 21 31 41 51 
I I I I I I 

GCAAAGCCAG CTGGGCTCCT GAGTCCGGTG GGTACTTGGA GAACTTACTA OGTCTAGCTG 60 
G AGQAT TGTA AATGCAOCAA TCAGCATGCT GTOTCTAGCT CAAGATTTTC TCCATCCCCT 120 
TATTTTGGGC CAGTGGCTGT CATTACATAT GAGATGAGTC TCTTGAAGAC TACAGATGAA 180 
CTCAAGCTCC ATGAGG AGAT GTTTCATTGT OGAGAGCAGT CATGATGGCC TGCACTCCAC 240 
ACAATGCAAC AGAGTGAAAG AGCAGGTTCT GCTTCTTTGG TGTAGTCCTG AAGCTTCCTA 300 
AGAAACTTCA CATCAGGTGA TGGATAGGAG CAACCCTGTA AAACCAGCCT TAGACTATTT 360 
TTCAAACAGG CTGGTGAATT ACCAGATCTC CGTCAAGTGC AGTAACCAGT TCAAGTTGGA 420 
AGTGTGTCTT TTGAATGCAG AGAACAAAGT CGTGGACAAC CAGGCTGGGA CCCAGGGCCA 480 
GCTGAAGGTG CTGGGTGCCA ACCTCTGGTG GCCGTACCTG ATGCACGAAC ACCCCGCCTA 540 
CCTG TACIC C TGGGAGGATG GTGATTGCTC ACACCAAAGC CTTGGACCCC TCCCAGCCTG 600 
TGACCTTTGG GACCAACTCC ACCTAOGCAG CAGACAAGGG GGCTCTGTAT GTGG ATGTGA 660 
TOCGTGTGAA CAGCTACTAC TCTTGGTATC GCAACTACGG GCAOCTGGAG TTGATTOGGC 720 
TGCAGCTGGC CGCCCAGTTT GAGAATTGGT GTGAGACATC ACAATOOCAT TATTCAGAGC 780 
GCGTATGGAG TGGAAACGCT TGT AGGGTTT CAOCAGGGCT GGTGAATTAC CAGATCTCCG 840 
TCAAGTGCAG TAACCAGTTC AAGTTGGAAG TATGTCTTTT GAATGCAGAA AACAAAGTCG 900 
TGG ACAA(XA GGCTGGGACC CAGGGCCAGC TGAAGGTGCT GGTGCCAACC TCTGGTGGCC 960 
GT ACCTGATG CACGAACACC OCGOCTAOCT GT ACTCGTGG GAGGATGGTG ATTGCTCACA 1020 
CCAAAGCCTT GGACCCCTCC CAGCCTGTGA CCTTTGGGAC CAACTCCACC TAOGCAGCAG 1080 
ACAAGGGGGC TCTGTATGTG GATGTGATCC GTGTGAACAG CTACTACTCT TGGTATCGCA 1140 
ACTACGGGCA OCTGGAGTTG ATTCGGCTGC AGGCCCTGCA GCTGGOCGGC CAGTTTGTGA 1200 
ATTGGTGTAA G AC ATCA CAA TCCCATTATT CAGAGCGCGT ATGGAGTGQA AACGCTTGTA 1260 
GGGTTTCACC AGTCTTTOOC AGGGAACTCC GATGAAGTGT TCCAACAAAA TGAGCGAGTG 1320 
AACCAAGAAG AGG ATGACAT TAGATCCAGG AGATACAACA GAGGAG ATAA TCTCCAGG AT 1380 
GCCTGTGAAG AAAGATCCCT GGATCCCAGG ATGATTATAG GACAAGTTGT TCATAATCCA 1440 
GCAGGOCAGA AGACTTCCAG GGAAACTCAT TTCAAGATGA AAATGGACCA GCCGCAGTGG 1500 
CTCAOGCCTG TAATACXAGC ACTTTGGG AG GCTGAGGCGG GCGGATCACT TGAGGTCAAG 1560 
AGTTTGAAAC TAGCCTGGCC AACGTGGCAA AACTCCATCT CTATTAAAGA TACAAAAATT 1620 
AGCCAGGCAT AGTGGTGCAT GCCTGTAGTC CCAGCTACTT GGGATGCTGA GGCAGGAAGA 1680 
ATTGCTFGAA CCFGGG AGGC AG AGTCTGCG GTGACCGAGA TCATGCCACT GCACTCCAGC 1740 
CTGGGTGACA GAGCCAGACT CCGTCTCTAC TAAAAAAAAA AAAAAAAAAA AAA 

Seg ID NO. 3 Protein sequence: 
Protein Accession #: CAA582S0.1 

1 11 21 31 41 51 
I I I I 1 I 

MDRSNPVKP A LDYFSNRL VN YQISVKCSNQ FKLEVCLLNA ENKWDNQ AG TQGQLKVLGA 60 
NLWWPYLMHE HPAYLYSWED GDCSHQSLGP LPACDLWDOL HLRSRQGGSV CX3CDPCEQLL 120 
LLVSQLRAPG VDSAAAGRPV 

TPNO 4 DNA sequence 
NikIckj Acid Acccssimi ft BC00262Z I 

1 11 21 31 41 51 
1 I I I I I 

GGCACGAGGC TOCGCCOGOG GOCGGGATGC ACTAGGCAAA GCCAGCIXK5G CTOCTGAGTC 60 
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CGGTGGGTAC TTGGAQAACT TACTAOGTCT AGCTGQAGGA TTGTAAATGC ACCAATCAGC 120 
ATGCTGTGTC TAGCTCAAGA TT ITCTO CAT CCCCTTATTT TOQOOCAOTO GCTGTCATTA 180 
CATATGAGAA CTCAAGCTCC ATGAGGAGAT GTTTCATTGT CGAGAGCAGT CATGATGGCC 240 
TGCACTCCAC ACAATGCAAC AGAGTGAAAG AGCAGGTTCT GCTrcnTGG TGTAGTCCTG 300 
AAGCITCCTA AGAAACTTCA CATCAGGTGA TGGATAGGAG CAAOCCTGTA AAACCAGCCT 360 
TAG ACTATTT TTCAAACAGG CTGGTGAATT AGCAGATCTC CGTCAAGTGC AGTAACCAGT 420 
TCAAGTTGGA AGTGTGTCTT T KjAATCCAG AAAACAAAGT CGTGGACAAC CAGGCTGGGA 480 
CCCAGGGCCA GCTGAAGGTG CTGGGTGCCA ACCTCTGGTG GCCGTACCTG ATGCACGAAC 540 
ACOOOGCCTA CCTGTACTCG TGGGAGGATG GTGATTGCTC ACAOCAAAGC CTTGGACCCC 600 
TCCCAGCCTG TGACXTTTGT GACCAACTCC ACCTACGCAG CAGACAAGGG GGCTCTGTAT 660 
GTGGATGTG A TCCGTGTGAA CAGCTACTAC TCTTGGTATC GCAACTACGG GCACCTGGAG 720 
TTGATTCAGC TGCAGCTGGC CGCCCAGTTT GAGAATTGGT GTAAGACATC ACAATCCCAT 780 
TATTCAGAGC GCGTATGG AG TGG AAACGCT TGTAGGGTTT CACCAGTCTT TCCCAGGGAA 840 
CTCCGATGAA GTGTTCCAAC AAAATGAGCG AGTGAACCAA GAAGAGGATG ACATTAGATC 900 
CAGGAGATAC AACAG AGGAG ATAATCTCCA GG ATGCCTGT GAAGAAAGAT CCCTGGATCC 960 
CAGGATGATT ATAGGACAAG TTGTTCATAA TCCAGCAGGC CAGAAGACTT CCAGGGAAAC 1020 
TCATTCAAGG AGGTGAAAAT GATGGATG AC TCCTCCAAGA TGAAAATGGA CCAGCCGCAG 1080 
TGGCTCACGC CTGTAATAOC AGCACTTTGG QAGGCTGAGG CAGGOGGATC ACTTGAGGTC 1140 
AGGAGTTTGA AACTAGCCTG GCCAACGTGG CAAAACTCCA TCTCTATTAA AAATACAAAA 1200 
ATTAGCCAAG CATAGTGGTG CATGCCTGTA GTCCCAGCTA CTTGGGATGC TGAGGCAGGA 1260 
AGAATTGCTT G AACCTGGG A GGCAGAGTCT ACAGTQAGCC GAGATCATGC CACTGCACTC 1320 
CAGCCTGGGC AACACAGTGA G ACTCCATCT CAAAAAAAAA AAAAAAAAAA AA 

Seo ID NO: 5 Protein sequence 
Proteh) Accession if: AAH02622.1 

1 11 21 31 41 51 
I I I I 1 I 

MDRSNPVKPA LDYFSNRLVN YQISVKCSNQ FKXEVCLLNA BNKWDNQAO TQGQLKVLGA 60 
NLWWPYLMHE HPAYLYSWED GDCSHQSLGP LPACDLCDQL HLRSRQGGSV CGCDPCEQLL 120 
LLVSQLRAPG VDSAAAGRPV 

Sea ID NO: 6 PNA scqnencc 

Nucleic Acid Accession U: sec Tabic 25 &25A for complete Kst 



1 11 21 31 41 51 

I I I I J I 

ACCTG AGATC AGGAGTTCGA G ATCAGCCTG ACCAATAGGG TGAAACCCCG TCTCTACTAA 60 
AAATACAAAA AATTAGCTGG ACACGATGGT GGGTGCCTGT GGTCCCGGCT ACTCGGGAGG 120 
CTGAGACAGG AGAATCAGTT G ACCTG GO AG TTGGTGGTTG CAGTGAGCTG AGATCACACC 180 
ATTGCATTCC AAGCCTGGGC AACAAGAGTG AAACTCCATC GCAAAAAAAA AAAAGAAGGG 240 
GCATAATTTG TGG ATOAGGA TTGG ATATAA GGTAAAGGAT GGGACATTCT TGG ACTTACA 300 
G ATGGTGTGA TTGCCTGGCT AGAAGAAGAA TTCCCGGTCA AAAAGAAACC ATCAGCTTTC 360 
CAAGTGTGAA AGAGAGATAA ATCTGTGAAG ATTATAGGGA CTACAGGAAA CTTAATCTTT 420 
TTCTTTGAAA AAGCAATTGT AGCAAAAAAA AAGAAAATTT CTTACTGTCA T CTAAA ATTG 480 
ACATGGACAT CTTAGTGGAC TAGAAGTTAA GGGCATAAAT TCTCCCAGTG ATTTTTAATT 540 
TTAGCATTGT GATTAACACC TTCTAAAATT GCCAGAACTT AATAAATAAT TGCTTTTCAT 600 
TATTAGTATG CCATCAAATT TAGTAGCTGT TTCAGGCTTT AATGTGTCAA GCCTAAAATC 660 
GAGATTTTTQ AGGATCTTCT CCCTCTTAAA AGAGTATTCA GTTAACTGCC GTAGAAATAC 720 
ACATGT ATAC AAGGGCACTG TATACATCAG TCTAAAAAAT AAAAATATGT ATACGTTCTG 780 
GTG AGTCT AG CACAGCATTG CCCAAT AG AA ATACCAATGG AGGTCACAAA TGTGGCOCAT 840 
ATAGGTTAAT TGGT AAATTT TCTNATAGNC ACC 

Seq PNft 7 PN A sequence 

Nucleic Acid Accession*: AK000942 

Coding sequence 1204-1503 



1 II 21 31 41 51 

I I I J I I 

GTAAAGGAAT GTCTTTTTAA TTCAGCI 1 IT CTTTTCTCCA TGCTAGTGTT ATCAGGTTTT 60 
GGTATTTATT TACTTACAGC ATATGTTATG AAGCTGGTTT GAAAATTGGT TTTAGATATA 120 
TCTGCAAGTT TACTACTTTG ACTGTAAAAA AAAAAAATG A AAAAGTAGTT GACATCTGTC 180 
CTCAGAAGAA GTTTGCAGGT TGCATATTTG TGTGTAAATA CACAGGCTAA AAGGTAATTT 240 
ATGTTCCTTG GGAATTGAAA TGGTCAGTGG CCCGTTACAG AAACTTATCA GTCATATATC 300 
AGCACCAGTT CATTCTTTTG CACCTTAGGG ACCATCTGTC CCCTGAGGTG ACCTGAGAAA 360 
CAACCAGTTG CCCACAGACT GTTATTTCTT CAAGTGAGCC AGGATTTO AT TTCACTGCCT 420 
TATATTCTAT TTTTAGTGTA CAGTGCTTTG ATTTTTTGGA AAAACTAAAT TTTAAACATA 480 
TTTGAAAAAT GTTAT AAGAC TTGGACATTA AGTCTGTTGA TAGCCAAAGT CAGTTT ACCA 540 
AAGTAAAACA AATAAATTCT ATGCTTCTTC ATTGTCAAAG AGCAGTCTGC CATCATGTGG 600 
ATATAAATGG ACTATGTAAA GTGACATGGT GCTTACTCTC TACCTAATAA TAGCCTCCCT 660 
CCTGTTCCAA CAAG ATAAOC AACAGGTATA TTTAATTTAC CAGTTAATAT GTTTTGGATA 720 
ATTGGCTGCC TTGAAATGCT ATATGTTTTA TAGTACATCA TAGCTTTAGT TTTCTTCATA 780 
AGG AAATT AC AGTTACATCC TGGCTAACAT GGTG AAACTC CATCTCTACT AAAAATACAA 840 
AAAATTAGCC GGGCGTGGTG GCGGGCACTT GTAGTCCCAG CTACTCGGGA GGCTGAGGCA 900 
GGAGAATGGC GTGAACCCAG G AGGCGGAGG TTGCAGTGAG CCGAGATCGT GCCACTGTAC 960 
TCTGGCCTGG GAGACAGAGC GAGACTCCAT CTCAAAAAAA AAAAAAAAAA AAAAAAAAGA 1020 
GAGAGAGAGA CCTGGAGTAG AGATTCTGTC AAAGAACTTT TTCTTTCTTG AGAAGCATCT 1080 
GAAATGG AAT CTGTTGTCTC TTCGAAATAT GTACIGCTGT AACAGTG AAA CAACCCTCAG 1140 
AGTATGCCTT CGTGTGGGCT ACTCGTTGTG GTTTTGAACT TGGGGGAACT GTCTGTGTTT 1200 
GGGTCAAG AA TATGCAACTG GCTGGGCACA TTGGCTCACG CCTGTAATCC CAGCAATTTG 1260 
GGAGGCTGAG GCAGGCGGAT CACCTGAGGT CAGGGCTTCA AGACCAGACT GGCCAACATG 1320 
GTGAAACCCC GTCTCTACTG AAAATACAAA AATTAGCTGG GCATGGTGGC AGGTGCCTGT 1380 
AATCCCAGCT ACTCGGGAGG CTGACGTGAG AGAATCGCTT GAACCCGGGA GTTGGAGGTT 1440 
GCAGTGAGCC GAGATTGCAC CATTGCACTC CAGCTTGGGC AACAAGAGTG AAACTCTTGT 1500 
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CTCAO 

&qIDNO;8 PNAsecoence 

Nocl^ Acid Accession see Table 25 &25A for complete fci 



I U 21 31 41 51 

L 1 1 1 1 i 

GACT AGGCTG GGCAACATAO TGAGAOCTCA TCTCTAAAAT TAAAAAAATA AAAOOCAGCA 00 
GAAAAAAACC TAAAAACATG CCAAGTGACA TCAGTCTTTG ATGAAAATGG CAGCAGAAGA 120 
GTOATGCCAT GGGTGGGGGT GGGAAATGCT ATTTCAGCAG AGAGGGAGCT GTCATGGAAG 180 
ACACCATQTO GCTGGG CAOG GTGGCTCACA CCTGTAATCC CAGCAC1TTG GGAGATAGAG 240 
GCAGGTGGAT 0CC1 lUAGCT TAGGAATTTG AGACTAGCCT GGGCAATAAG AGTGAAACTC 300 
CATCTCAAAA AAAAAAAAAA AAAAAGGTGC ATGAAACAT A TGAAGCAAAA AGTGAAAGTC 360 
OOCAi mi fT lLLU 1 1 ICC AGAGGTQATT TTTGTGGCCA A1 C1 UG 11 1 C ATTCOC1XXC 420 
AGA CACTT TT CTAGGCATCT ATGCGCCTCT ATTCACATAT AAACAAAATA GGAGTTTTCC 480 
TGTGCTTCCC TTAAATGGCA TATGTATCIT TCACTCTTTT TTTTCACCTA GTGGATCTTT 540 
AATACCTTAA AAGCTCAACC TGGGCTTGGT GCGGTGGCTC ATACGTGTAA TCCCAGGCCT 600 
TTGGGAGGCC A AGGTG GGAG GATCACTTGA GCTCAGGAGT TCCAGACCAT TCCAAAGCAA 660 
AAACAAAAGG ATTTTGAGAT CAGTGTGGGC AACTTAGCAA AACACCATCT CTTAAAAAAA 720 
AAAAAAAAAA 

SeqIDNO:9 DNAscqaence 

Nucleic Add Accession ft BC01 0433.1 

Coding sequ enc e. 3-335 

1 U 21 31 41 51 
I I 1 I I I 

GGTCGCCCTC CGTCGTGGTC TGGCGTGTAT TCCGAGCCTT GGTGTCTGGC GGTTTCCGAG 60 
CGTTGGTGTC TGGCGGTTTC OQ AGCGTTGO TQTCTGGCGG TTTOCQACCG TTGGTGTCTG 120 
GOGGTTTCCO ACCGTTGGTG TCTGGCACGC GCCACCCTCT Cl 1UUT1U G TTGCGCCATG 180 
CCGATGTACC AGACAAG AAG ACAAGAAAAT GATTTGAGGA CAGCTTCAAT CGCGGTGTGA 240 
AGAAGAAAGC AGCAAAACGA CCACTGAAAA CAACGCCGGT GGCAAAATAT CCAAAQAAAO 300 
GGTCCCAAGC GGTACATCGT CATAGCCGGA AACAGTCAGA GCCAOCAGCC AATGATCTTT 360 
TCAATGCTGC G AAAGCTGCC AAAAGTGACA TGCAGCACCG AGAAGTCCGC GTGAAGTGCG 420 
TGAAGGCTCT GAAAGGGCTG TACGGTAACC GGGACCTGAC CGCACGCCTG GAGCTCTTCA 480 
CTGGCCGCTT CAAGGACTGG ATGGTTTCCA TGATCATGGA CAGAGAOTAC AOTGTGGCAG 540 
TGGAGGCOGT CAGATTACTG ATACTTATCC TTAAGAACAT GGAAGGGGTG CTGATGGACG 600 
TGGACTGTGA GAGCGTCTAC CCCATTGTGT AGGCCTCTAA TTGAGGCCTG GCCTCTGCTG 660 
TGGGTGAATT TCTGTACTGG AAACTTTTCT ACCCTGAGTG CGAGATAAGA ACGATGGGTG 720 
GAAP AGA GCA ACGCCAGAGC CCAGGTGCCC AGAGGACTTT CTTCCAGCTT CTGCTGTCCT 780 
TCTTTGTGGA GAGCAAGCTC CACGACCACG CTGCTTACTT AGTAGACAAC CTGTGGGACT 840 
GTGCAGGGAC TCAGCTGAAG GACTGGGAGG GTCTGACAAG CCTGCTGCTG GAGAAGGACC 900 
AGAGCACGTG CCACATGGAG CCAGGGCCAG GGACCTTCCA CCTCCTAGGG TGAAACCAGG 960 
AGAGATTGCT TGCTTCACTT GTACAAGGCA GGAACGGTGG CATGGGGTGG GGGAAACTTG 1020 
GAGTTGGAAG GTGGCTAATC TTTGATTCTA TGI HUG AT CCTCCTGGCA CTCCAGACCT 1080 
GGGTGATGTG CAGGAGAGCA CACTGATAGA AATCCTTGTG TCCAGTGCCC AGCAACTCCT 1140 
GCCTCAGCCT CCCGAGCAGC TGGGACTACA GGCGOCCGCC ACCACGCCTG GCTAACTTTT 1200 
TTGTGTTTTT AGTAGAGACG GGTTTTCACC GTGTTGGCCA GGATGGTCTT GATCTCTTGA 1260 
CCTTGTGATC CACCTGCCTC ATCATCCCAA AGTGCTGGGA TTACAGGCGT G AGCCACTGC 1320 
GCCCAGCATG TTAGACAATT TTTAATTCAT CCICrCTGTG CTGTTGTTTT CTCAGCTGTG 1380 
AAAGGAATAT TCTGGTGGGG ACAAGGTTAC AGAGTTGCTG AGAGGGTCTC ATG ACATGAA 1440 
GGTACTGGOC TTGGCACAGT GCCTGGGGGG GCGGGGACTC CGCACATGCC TGTGATGTCA 1500 
CAGTTACTGT CAGTTCACA G CGAACC TTCC CrCCTTTTCC TGTTGACTTT CCCACACTCC 1560 
TGTAACCCTC CCTCCCTCCC 1 1 Cl 1CCI Cl' CTCTCTCICT CACTCACGCA CACGCACACA 1620 
CACACACACA CACACACACA CACACACTCC ATTCACTGTC TCCATGACTC TGGAGTAAAC 1680 
TAAC GTCTC G AGTTGCCATT GGAAGCCCCG TTGTCCTCAT TTAG ACTTTC ATGGGTTATA 1740 
GGCACTTTTG ACTTCCTGGG GTCCTTCTTC AGTTAAAAAA AAAAATTAGA AAATTAGGCC 1800 
GGGOGTGGTG GCACATGCCT GTAATCCCAG CACCTTGGCC TCCCAAAGTG CTGGGATTAC I860 
AGGAGTGAGC CACCATGCCC AGCCTCCGTT GTCCTCATTT AGACTTTCAT GGGTTATAGG 1920 
CAC1U1GAC TTCCTGGGGT CCTTCTTCAG TTAAAAAAAA AAAAAAAAAA 



Sea ID NO: 10 DNA sequence 

NocJett Acid Accession fc sec Tabic 25 & 25 A for complete list 



1 11 21 31 41 51 
I I I I I I 

AGTGGNTCCC CCGGNCTGCA GGAATTOGGC ACGAGATCAT GATGGCTAAT ATTTCCTGAG 60 
CACCTTTCAT TCAGGCATGA TGCCAGGTGC ACCAACTTAC TTAATCCTCA TAGCCACCAC 120 
CTGAGC AAG CTCCTGTn TA TAAATGGACC A G11C1 1UJT GCTGTTOTAC AAGTTATTTT 180 
CTTTCTATAA CGTCCTCCTT GTCCTCCTTC CACATTCTTA AAGAAACTTT CC C1 1CC1 1 f 240 
AAAGTACTCA GGGAGCCCTG CATTGCTTCT TGAAGCCTTC TCCAGCTTCA TCATCTCACA 300 
GTGGTCTCTC i J i iuACTAA ATGTCCAATA TGCTGCACAT AAGTACCCCA AAGTTAGCAC 360 
AGGAATTGTT CCATGGCTGT CATATATGTT AAAAATCATT AAAAGTTCAT TTTTTCTCTC 420 
ATTATGGGAA GGATACATGC TCCTACTAGT AAATTTAGTA GGTAGAAAAA AATTATCACT 480 
ATCTAGACTG CTTTCCATTT AGTCTTTATG CATAGCTTTC GTGTCTGCCT ATTTTTACCT 540 
TGTGTTTGTA ACTTACTATT ATAAAAT ATG CGTCTCTATG TTCATTGTCA ACGATTATTT 600 
ACAATAACAT GG AGTGG ATT TACATGTATT CTCTATATTT GGATTAAAGG AGATAG AGTA 660 
TGTGAAATTA AATGGG AGAA GTATCTGATA CATAACAGGC AATACAAATA TTATCACATA 720 
GCGTCAATTT A'l 1 1 GTGAAT ATTGAAAGCT CCAAAAAAGA AAAAAA G ' l 1 1 111111A ATT 780 
CCCGT AATTA CTTATTGCAG TATTGTGTTC ATACAAACTG CTCAGTCATT TTGGAG AAAT 840 
aacaattttt TTCCTCATCA TG AAGTAAGG TATGCTCACT GCAAAAAAAA TCTAGAAAAT 900 
AAAGAGGAAC ATGCTAAAGA AAAGAATACT CCCATATAAT CTCTGTCTTC ATAAATAATC 960 
TTTTGTAACG CTTATACACT GCTGGTGGGA ATGTAAATTA GTTCAGCCAT TGTGAAAAGT 1020 
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AGCGTAGCAA TTCCTTGAAA AACTTAAAAT AGATTTACCG TTCAACCCAG CAATCCCATT 1080 
ATTGGGCATA TAOCCAGTGQ AATGTAAATC ATCCTGCCAT AAAAACACAT GCACATGTAT 1140 
GTTCATTGCA GCACTATTCA CAATAGCAAA GACATGG AAT CAACCTATAT GCCCATCAAT 1200 
AGTAG ACTGA ATAAAGAAAA TATGGTACAT ATTCACCACA GAATACTAAG CAGCCATAAA 1260 
AAAAAA 



SeqlDNO; 1 1 DNA sequence 

Nuclei Add Accession* NMJD00582.1 

Coding sequence: 88-990 

1 11 21 31 41 51 
I I I I I I 

GCAGAGCACA GCATCGTCGG GACCAG ACTC GTCTCAGGCC AGTTGCAGCC TTCTGAGOCA €0 
AACGCCG ACC AAGGAAAACT CACTACCATG AGAATTGCAG TGATTTGCTT TTGCCTCCTA 120 
GGCATCACCT GTGCCATACC AGTTAAACAG GCTGATTCTG GAAGTTCTGA GGAAAAGCAG 180 
CTTTACAACA AATACCCAGA TGCTGTGGCC ACATGGCTAA ACCCTGACCC ATCTCAGAAG 240 
CAGAATCTCC TAGCCCCACA GACCCTTCCA AGTAAGTCCA ACGAAAGCCA TG ACCACATG 300 
GATGATATGG ATGATGAAGA TGATGATGAC CATGTGGACA GCCAGGACTC CATTGACTCG 360 
AACGACTCTG ATGATGTAGA TGACACTG AT GATTCTCACC AGTCTGATGA GTCTCACCAT 420 
TCTGATGAAT CTGATGAACT GGTCACTGAT TTTCCCACGG ACCTGCCAGC AACCGAAGTT 480 
TTCACTCCAG TTGTCCCCAC AGTAGACACA TATGATGGCC GAGGTGATAG TGTGGTTTAT 540 
GGACTG AGGT CAAAATCTAA GAAGTTTCGC AGACCTGACA TCCAGTACCC TGATGCTACA 600 
G ACGAGGACA TCACCTCACA CATGGAAAGC GAGGAGTTGA ATGGTGCATA CAAGGCCATC 660 
CCCGTTGCCC AGGACCTGAA CGCGCCTTCT G ATTGGG ACA GCCGTGGGAA GGACAGTTAT 720 
GAAACG AGTC AGCTGGATGA CCAGAGTGCT G AAACCCACA GCCACAAGCA GTCCAGATTA 780 
TATAAGCGGA AAGCCAATGA TGAGAGCAAT GAGCATTCCG ATGTGATTGA TAGTCAGGAA 840 
CTTTCCAAAG TCAGCCGTQA ATTCCACAGC CATGAATTTC ACAGCCATG A AGATATGCTG 900 
GTTGTAGACC CCAAAAGTAA GG AAG AAGAT AAACACCTGA AATTTOGTAT TTCTCATGAA 960 
TTAGATAGTG CATCTTCTGA GGTCAATTAA AAGGAGAAAA AATACAATTT CTCACTTTGC 1020 
ATTTAGTCAA AAGAAAAAAT GCTTTATAGC AAAATGAAAG AGAACATGAA ATGCTTCTTT 1080 
CTCAGTTTAT TGGTTOAATG TGTATCTATT TGAGTCTGGA AATAACTAAT GTGTTTGATA 1 140 
ATTAGTTTAG TTTGTGGCTT CATGGAAACT CCCTGTAAAC TAAAAGCTTC AGGGTTATGT 1200 
CTATGTTCAT TCTATAG AAG AAATGCAAAC TATCACTGTA TTTTAATATT TGTTATTCTC 1260 
TCATG AATAG AAATTTATGT AGAAGCAAAC AAAATACTTT TACCCACTTA AAAAG AG AAT 1320 
AT AACATTTT ATGTCACTAT AATCTTTTGT 1111 1 AAGTT AGTGTAT ATT TTGTTGTG AT 1380 
TATCTTTTTG TGGTGTGAAT AAATCTTTTA TCTTGAATGT AATAAGAATT TGGTGGTGTC 1440 
AATTGCTTAT TTGTTTTCCC ACGGTTGTCC AGCAATTAAT AAAACATAAC CTTTTTTACT 1500 
GCCT AAAAAA AAAAAAAAAA AAAA 

SeqlDNO: 12 Protein sequence: 
Protein Accession #: NP0OO573.1 

1 11 21 31 41 51 

I I I I I I 

MRIA VICFCL LGJTCAIPVK QADSGSSHEK QLYNKYPDAV ATWLNPDPSQ KQNLLAPOTL 60 
PSKSNESHDH MDDMDDEDDD DHVDSQDSID SNDSDDVDDT DDSHQSDESH HSDESDELVT 120 
DFPTDLPATE VFTPVVPTVD TYDGRGDSW YGLRSKSKKF RRPDIQYPDA TDBDTTSHMB 180 
SEELNG AYKA IPVAQDLNAP SDW DSRGK DS YETSQLDDQS AETHSHKQSR LYKRKANDES 240 
NEHSDYIDSQ ELSKVSREFH SHEFHSHEDM LWDPKSKEE DKHLKFR1SH HLDSASSBVN 



SeqlDNO: 13 PNA sequence 

Nucleic Arid Accession ft NMJD01793 

Coding sequence: 71-2560 

1 11 21 31 41 51 
I I I I I I 

AAAGGGGCAA GAGCTGAGCG GAACACCGGC CCGCCGTCGC GGCAGCTGCT TCACCCCTCT 60 
CTCTGCAGCC ATGGGGCTCC CTCGTGGACC TCTCGCGTCT CTCCTCCTTC TCCAGGTTTG 120 
CTGGCTGCAG TGCGCGGCCT CCGAGCCGTG CCGGGCGGTC TTCAGGGAGG CTGAAGTGAC 180 
CTTGG AGGCG GGAGGCGCGG AGCAGGAGCC CGGCCAGGCG CTGGGGAAAG TATTCATGGG 240 
CTGCCCTGGG CAAGAGCCAG CTCTGTTTAG CACTGATAAT GATGACTTCA CTGTGCGGAA 300 
TGGCGAGACA GTCCAGGAAA GAAGGTCACT GAAGGAAAGG AATCCATTGA AGATCTTCCC 360 
ATCCAAACGT ATCTTACGAA GACACAAGAG AGATTGGGTG GTTGCTCCAA TATCTGTCCC 420 
TGAAAATGGC AAGGGTCCCT TCCCCCAGAG ACTGAATCAG CTCAAGTCTA ATAAAGATAG 480 
AGACACCAAG ATTTTCTACA GCATCACGGG GCCGGGGGCA GACAGCCCCC CTGAGGGTGT 540 
CTTCGCTGTA GAGAAGGAGA CAGGCTGGTT GTTGTTGAAT AAGCCACTGG ACCGGGAGGA 600 
GATTGCCAAG TATGAGCTCT TTGGCCACGC TGTGTCAGAG AATGGTGCCT CAGTGGAGGA 660 
CCCCATGAAC ATCTCCATCA TCGTGACCGA CCAGAATG AC CACAAGCCCA AGTTTACCCA 720 
GGACACCTTC CGAGGG AGTG TCTTAGAGGG AGTCCTACCA GGTACTTCTG TGATGCAGGT 780 
GACAGCCACG GATGAGGATG ATGCCATCTA CACCTACAAT GGGGTGGTTG CTTACTCCAT 840 
CCATAGCCAA GAACCAAAGG ACCCACACGA CCTCATGTTC ACCATTCACC GGAGCACAGG 900 
CACCATCAGC GTCATCTCCA GTGGCCTGGA CCGGGAAAAA GTCCCTGAGT ACACACTGAC 960 
CATCCAGGCC ACAGACATGG ATGGGGACGG CTCCACCACC ACGGCAGTGG CAGTAGTGGA 1020 
GATCCTTGAT GCCAATGACA ATGCTCCCAT GTrTOACCCC CAGAAGTACG AGGCCCATGT 1080 
GCCTG AGAAT GCAGTGGGCC ATGAGGTGCA G AGGCTGACG GTCACTGATC TGG ACGCCCC- 1140 
CAACTCACCA GCGTGGCGTG CCACCTACCT TATCATGGGC GGTGACGACG GGGACCATTT 1200 
TACCATCACC ACCCACCCTG AG AGCAACCA GGGCATCCTG ACAACCAGGA AGGGTTTGGA 1260 
TTTTGAGGCC AAAAACCAGC ACACCCTGTA CGTTGAAGTG ACCAACGAGG CCCCTTTTGT 1320 
GCTGAAGCTC CCAACCTCCA CAGCCACCAT AGTGGTCCAC GTGG AGGATG TGAATGAGGC 1380 
ACCTGTGl n GTCCCACCCT CCAAAGTCGT TO AGGTCCAG GAGGGCATCC CCACTGGGGA 1440 
GCCTGTGTGT GTCTACACTG CAGAAGACCC TGACAAGGAG AATCAAAAGA TCAGCTACCG 1500 
CATCCTGAGA GACCCAGCAG GGTGGCTAGC CATGGACCCA GACAGTGGGC AGGTCACAGC 1560 
TGTGGGCACC CTCGACCGTG AGGATGAGCA GTTTGTGAGG AACAACATCT ATGAAGTCAT 1620 
G0TCTTGGCC ATGGACAATG GAAGCCCTCC CACCACTGGC ACGGGAACCC TTCTGCTAAC 1680 
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ACItJATTGAT GTCAATQACC ATGGOOCAGT COCTGAGOCC CGTCAGATCA CCATCTGCAA 1740 
CCAAAGCC CT GTGCGCCAGG TGCTGAACAT CACGGACAAG GACCTGTCTC COCACAOCTC 180D - 
COCTTTCCAO GCCCAGCTCA CAGATGACTC AQACATCTAC TGGACGGCAG AGGTCAACQA I860 
GG AAGGTGAC ACAGTG G ICT TGTOOCTOAA GAAGTTCCTO AAGCAGGATA CATATGACGT 1920 
GCACCTTTCT CTGTCTGACC ATGGCAACAA AGAGCAGCTG ACGGTGATCA GGGCCACTGT 1980 
GTCCGACTGC CATGGCCATG TCGAAAOCTG OCCTGGACOC TGGAAGGGAG GTTTCATCCT 2040 
CCCTGTGCTQ GGGGCTGTCCTGGCTCTOCT G l lOCllXIIU CTGGTGCrGC 1 I I UiTWi OT 2100 
GAGAAAGAAG CGGAAGATCA AGGAGCCOCT OCTACrCOCA GAAGATGACA CCCGTGACAA 2160 
CGTCT1LTAC TATGGOGAAG AGGGGGGTGG OGAAGAGGAC CAGGACTATG ACATCACCCA 2220 
GCTOCAOOGA GGTCTGGAGG CCAGGOCGGA GOTQOTTC TC OGCAATQACQ TOOCAOCAAC 2280 
CATCATOOOO ACACCCATGT AOCGTOCTOG GCCAGCCAAC CCAGATGAAA TOGGCAACTT 2340 
TATAATTGAG AAOCTGAAGG CGGCTAACAC AGACCOCACA GCCCCGCCCT ACGACAOCCT 2400 
CTTGGTGTTC GACTATGAGG GCAGCGGCTC CGACGCCGGG TCOCTGAGCT CXXnCACCTC 2460 
CTCCGOCTCC GAOCAAGACC AAGATT ACGA TTATCTGAAC GAGTGGGGCA GCCGCTTCAA 2520 
GAAGCTGGCA GACATGTACG GTGGCGGGGA GGAOGACTAG GCGGOCTGOC TGCAGGGCTG 2580 
CGGACCAAAC GTCAGG CCAC AGAGCATCTC CAAGGGGTCT CAGTTCCCCC TTCAGCTGAG 2640 
O ACTT OGGAG CTTGTCAGGA AGTGGCCGTA GCAACTTGGC GGAGACAGGC TATGAGTCTG 2700 
AOGTT AGAGT GGTTGCTTOC TTAGOCTTTC AGGATGGAGG AATGTGGGCA GTTTOACTTC 2760 
AGCACTGAAA ACCTCTOCAC CTGGGCCAGG GTTGCCTCAG AGGCCAAGTT TOCAGAAGCC 2820 
TCTTACCTGC CGTAAAATGC TCAACCCTGT GTCCTGGGCC TGGGCCTGCT GTGACTGACC 2880 
TACAGTG GAC 1 1 lClCrUTG GAATGGAACC TTCTTAGGCC TCCTGGTGCA ACTTAATnT 2940 
TTTTTTTAAT GCTA7XTTCA AAACGTTAG A GAAAGTTCTT CAAAAGTGCA GOOCAGAGCT 3000 
GCTG GGCCCA CTGGCCG TCC TGCATTTCTG GTTTOCAGAC COCAATGCCT CCCATTCGGA 3060 
TGGATCTCTG Ob J 1 1 I I ATA CTGAGTGTGC CTAGGTTGCC CCTTATTTTT TATTTTCCCT 3120 
GTTGCGTTGC TATAGATGAA GGGTG AGGAC AATOGTGTAT ATGTACTAGA ACTTTTTTAT 3180 
TAAAGAAACT TTTCCCAOAA AAAAA 

SeqIDNO: 14 Protein sequence 
Protein Accession ft NP 001784.2 



1 11 21 31 41 51 
I I I I I I 

MGLPRGPLAS LLLLQVCWLQ CAASEPCRAV FREAEVTLEA GGAEQEFGQA LGKVFMGCPG 60 
QEPALFSTDN DDFTVRNGET VQBRRSLKBR NPLKffPSKR ORRHKRDWV VAP1SVPENQ 120 
KGPFPQRLNQ LKSNKDRDTK IFYSITGPGA DSPPEGVFAV EKBTGWLLLN KPLDREEIAK 180 
YELFGHAVSE NGASVEDPMN 1SHVTDQND HKPKFTQDTF RGSVLEGVLP GTSVMQVTAT 240 
DEDDAIYTYN GWA YSIHSQ EPKDPHDLMF TIHRSTGTIS VESGLDREK. VFBYTLTfQA 300 
TDMDGDGSTT TAVA WEILD AKDNAPMFDP QKYEAHVPEN AVGHBVQRLT VTTHJ)APNSP 360 
AWRATY1JMG GDDGDHFTTT TBPES«JGIL TTRKGLDFEA KNQHTLYVEV TNEAPFVLKL 420 
PTSTATIWH VEDVNEAPVF VPPSKWEVQ EGIPTGEPVC VYTAEDPDKE NQKBYRJLR 480 
DPAGWLAMDP DSGQVTAVGT LDREDEQFVR WOYBVMVLA MDNGSPPTTG TGTLLLTLID 540 
VNDHGPVPEP RQITICNQSP VRQVLNTTDK DLSPHTSPFQ AQLTDDSDrr WTAEVNEEGD 600 
TWLSLKKFL KQDTYDVHLS LSDHGNKEQL TVIRATVCDC HGHVETCPGP WRGGFILPVL 660 
GAVLALLFLL LVLLLLVRKK RHKEPLLLP EDDTRDNVFY YGEEGGGEED QDYDITQLHR 720 
GLEARPEWL RNDVAPmP TPMYRPRPAN PDEIGNFIIE NLKAANTKT APPYDTLLVF 780 
DYEGSGSDAA SLSSLTSSAS DQDQDYDYLN EWGSRFKKLA DMYGGGEDD 

Seq ID NO: 15 DN A sequence 

Nucleic Acid Accession ft XM 05186O2 

Coding sequence: 261-4346 



1 U 21 31 41 51 
I I I I I I 

GAGCTAGOGC TCAAGCAGAG CXXAGCGCGG TGCTATOGGA CAGAGCCTGG CGAGOGCAAG 60 
CGGCGCGGGG AGCCAGCGGG GCTGAGOGCG GCCAGGGTCT GAACCCAGAT TTCCCAGACT 120 
AGCTACCACT CCGCTTGCOC ACGCCCCGGG AGCTCGCGGC GCCTGGCGGT CAGCGACCAG 180 
ACGTCCGGGG CCGCTGCGCT CCTGGCCCGC GAGGCGTGAC ACTGTCTCGG CTACAGACCC 240 
AGAGGG AGCA CACTGCCAGG ATGGGAGCTG CTGGGAGGCA GGACTTCCTC TTCAAGGCCA 300 
TGCTGACCAT CAGCTGGCTC ACTCTGACCT GCTTCCCTGG GGCCACATCC ACAGTGGCTG 360 
CTGGGTGCCC TGACCAGAGC CCTGAGTTGC AACCCTGGAA CCCTGGCCAT GAOCAAGACC 420 
ACCATXjTGCA TATCGGCCAG GGCAAGACAC TGCTGCTCAC CrCTTCTGCC ACGGTCTATT 480 
CCATCCACAT CTCAGAGGGA GGCAAGCTGG TCATTAAAGA CCACGACGAG CCGATTGTTT 540 
TG CGAACCC O GCACATCCTG ATTGACAACG GAGGAGAGCT GCATGCTGGG AGTGCCCTCT 600 
GCCCTTTCCA GGGC AATTT C ACCATCATTT TGTATGGAAG GGCTGATGAA GGTATTCAGC 660 
CGGATCCTTA CTATGGTCTG AAGTACATTG GGGTTGGTAA AGGAGGCGCT CTTGAGTTGC 720 
ATGGACAGAA AA AGCTCTCC TGGACATTTC TGAACAAGAC CCTTCACCCA GGTGGCATGG 780 
CAGAAGGAGG CTATTTTTTT GAAAGGAGCT GGGGCCACCG TGGAGTTATT GTTCATGTCA 840 
TCGACCCCAA ATCAGGCACA GTCATCCATT CTGACCGGTT TO ACACCTAT AGATCCAAGA 900 
AAGAGAOTGA ACGTClGGlt: CAGTATTTGA ACGCGGTGCC CGATGGCAGG ATC C1 TTCTG 960 
TTGCAGTGAA TGATGAAGGT TCTCGAAATC TGGATGACAT GGCCAGGAAG GCGATGACCA 1020 
AATTGGGAAG CAAACACTTC CnXjCACCTTG GATTTAGACA CCCTTGGAGT TTTCTAACTG 1080 
TGAAAGGAAA TCCATCATCT TCAGTGGAAG ACCATATTGA ATATCATGGA CATCGAGGCT 1 140 
CTGCTGCTGC CCGGGTATTC AAATTGTTCC AGACAGAGCA TGGCGAATAT TTCAATGTTT 1200 
CTTTGTCCAG TGAGTGGGTT CAAGACGTGG AGTGGACGGA GTGGTTCGAT CATGATAAAG 1260 
TATCTCAGAC TAAAGGTGGG GAGAAAATTT CAGACCTCTG G AAA^CTCAC (XAGGAAAAA 1320 
TATGCAATCG TCCCATTGAT ATACAGGCCA CTACAATGGA TGGAGTTAAC CTCAGCACCG 1380 
AGGTTGTCTA CAAAAAAGGC CAGGATTATA GGTTTGCTTG CTACGACCGG GGCAGAGCCT 1440 
GCCGGAGCTA CCGTGTACGG TTCCTCTGTG GGAAGCCTGT GAGGCCCAAA CTCACAGTCA 1500 
CCATTGACAC CAATGTGAAC AGCAC CATTC TGAACTTGGA GOATAATGTA CAGTCATGGA 1560 
AACCTGGAGA TACCCTGGTC ATTGCCAGTA CTGATTACTC CATGTACCAG GCAGAAGAGT 1620 
TCCAGGTGCT TCCCTGCAGA TCCTGCGCCC CCAACCAGGT CAAAGTGGCA GGGAAACCAA 1680 
TGTACCTGCA CATCGGGGAG GAGATAGACG GCGTGGACAT GCGGGCGGAG GTTGGGCTTC 1740 
TGAGCCGGAA CATCATAGTG ATGGGGGAGA TGGAGOACAA ATGCTACCCC TACAGAAACC 1800 
ACATCTGCAA TTTCTTTGAC TTCGATACCT TTGGGGGCCA CATXAAGTTT GCTCTGGGAT I860 
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TTAAGGCAGC ACACTTG6A0 GGCACGGAGC TGAAGCATAT GGGACAGCAG CTGGTGGGTC 1920 
AGTAOCCGAT TCACTTOCAC CTGGCCGGTQ ATGTAGAOGA AAGGGGAGGT TATGACCCAC 1980 
CCACATACAT CAGGGAOCTC TOCATOCATC ATACATTCTC TCGCTGCGTC ACAGTCCATG 2040 
GCTCCAATGG CTTQTTG ATC AAGGAOGTTG TGGGCTATAA CTCTTTGGGG CACTGCTTCT 2100 
TCACGGAAGA TGGGCCGGAG GAACGCAACA CTTTTGACCA CTGTCTTGGC CTCCTTOTCA 2160 
AGTCTGGAAC OCTOCTCCOC TOGGACCGTG ACAGCAAGAT GTGCAAGATG ATCACAQAGG 2220 
ACTCCTACXX GGGGTACATC CCCAAGCCCA GGCAAGACTG CAATGCTGTG TCCACCTTCT 2280 
GGATGGCCAA TCCCAACAAC AACCTCATCA ACTGTGCCGC TGCAGGATCT G AGGAAACTG 2340 
GATTTTGGTT TATTTTTCAC CACGTACCAA CGGGCCCCTC CGTGGGAATG TACTCCCCAG 2400 
GTTATTCAGA GCACATTCCA CTGGGAAAAT TCTATAACAA CCGAGCACAT TCCAACTACC 2460 
GGGCTGGGAT GATCATAGAC AACGG AGTGA AAACCACCGA GGOCTCTGOC AAGGACAAGC 2520 
GGCCGTTCCT CTCAATCATC TCTGCCAGAT ACAGCCCTCA OCAGGAOGCC GACCCGCTO A 2580 
AGOOOCGGGA GCCGGCCATC ATCAG ACACT TCATTGCCT A CAAGAACCAG GAGCAGGGGG 2640 
OCTGGCTGCG CGGCGGGGAT GTGTGGCTGG ACAGCTGCOG GTTTGCTGAC AATGGCATTG 2700 
GCCTGACCCT GGCCAGTGGT GGAACCTTCC CGTATGACGA CGGCTCCAAG CAAGAGATAA 2760 
AGAACAGCTT OTnOTTO OC GAGAGTGGCA ACGTGGGG AC GG AAATGATG GACAATAGGA 2820 
TCTGGGGCCC TGGCGGCTTO GACCATAGCG GAAGGAOCCT OCCTATAGGC CAGAATTTTC 2880 
CAATTAGAGG AATTCAGTTA TATGATGGCC CCATCAACAT CCAAAACTGC ACTTTOCGAA 2940 
AGTTTGTGGC CCTGG AGGGC CGGCACACCA GCGCCCTCGC CITOCGCCTG AATAATGCCT 3000 
GGCAGAGCTG CCCCCATAAC AAOGTGACCG GCATTGOCTT TGAGG ACGTT OCGATTACTT 3060 
CCAGAGTGTT CTTOGGAGAG CCTGGGCCCT GGTTCAACCA GCTGG ACATG GATGGGGATA 3120 
AGACATCTGT GTTCCATGAC GTCGACGGCT COGTGTCCGA GTACCCTCGC TOCTACCTCA 3180 
CGAAGAATG A CAACTGGCTG GTCCGGCACC CAGACTGCAT CAATGTTCOC GACTGGAGAG 3240 
GGGCCATTTG CAGTGGGTGC TATGCACAG A TGTACATTCA AGOCTACAAG ACCAGTAACC 3300 
TGCG AATGAA GATCATCAAG AATGACTTOC CCAGCCACCC TCTTTACCTG GAGGGGGCGC 3360 
TCACCAGGAG CACCCATTAC GAGCAATACC AACCGGTTGT CACCCTGCAG AAGGGCTACA 3420 
CCATCCACTG GGACCAGACG GOCCCCGCCG AACTCGCCAT CTGGCTCATC AACTICAACA 3480 
AGGGCGACTG GATCCGAGTG GGGCTCTGCT ACCCGCGAGG CACCACATTC TCCATCCTCT 3540 
CGGATGTTCA CAATCGCCTG CTGAAGCAAA CGTOCAAGAC GGGCGTCTTC GTGAGGACCT 3600 
TGCAG ATGGA CAAAGTGGAG CAG AGCTACC CTGGCAGGAG OCACTACTAC TGGGACGAGG 3660 
ACTCAGGGCT UUUHUJUj AAGCTG AAAG CTCAGAACGA G AGAGAG AAG TTTGCTTTCr 3720 
GCTCCATGAA AGGCTGTGAG AGGATAAAGA TTAAAGCTCT GATTCCAAAG AAOGCAGGCG 3780 
TCAGTGACTG CACAGCCACA GCTTACCCCA AGTTCACCGA GAGGGCTGTC GTAGACGTGC 3840 
CGATGCCCAA G AAGCTCTTT GGTTCTCAGC TGAAAACAAA GG ACCATTTC TTGG AGGTGA 3900 
AGATGGAGAG TTCCAAGCAG CACTTCTTCC ACCTCTGGAA CGACTTOGCT TACATTGAAG 3960 
TGGATGGGAA GAAGTACCCC AGTTCGGAGG ATGGCATCCA GGTGGTGGTG ATTGACGGGA 4020 
ACCAAGGGCG CGTGGTGAGC CACAOGAGCT TCAGGAACTC CATTCTGCAA GGCATAOCAT 4080 
GGCAG CTTI 1 CAACTATGTG GCGAOCATCC CTGACAATTC CATAGTGCTT ATGGCATCAA 4140 . 
AGGGAAGATA CGTCTCCAGA GGCCCATGGA OCAGAGTGCT GGAAAAGCTT GGGGCAGACA 4200 
GGGGTCTCAA GTTGAAAGAG CAAATGGCAT TCGTTGGCTT CAAAGGCAGC TTCCGGCOCA 4260 
TCTGGGTGAC ACTGGACACT GAGGATCACA AAGCCAAAAT CTTOCAAGTT GTGOCCATCC 4320 
CTGTGGTGAA GAAGAAGAAG TTGTGAGGAC AGCTGCCGCC OGGTGOCACC TCGTGGTAGA 43 BO 
CTATGACGGT GACTCTTGGC AGCAG ACCAG TGGGGGATGG CTGGGTCCCC CAGCCCCTGC 4440 
CAGCAGCTGC CTGGGAAGGC CGTGTTTCAG CCCTGATGGG CCAAGGGAAG GCTATCAGAG 4500 
ACCCTGGTGC TGCCACCTGC CCCTACTCAA GTGTCTACCT GGAGOCCCTG GGGCGGTGCT 4560 
GGCCAATGCT GGAAACATTC ACTTTCCTGC AGCCTCTTGG GTGCITCTCT CCTATCTGTG 4620 
CCTCTTCAGT GGGGGTTTGG GGACCATATC AGGAGACCTG GGTTGTGCTG ACAGCAAAGA 4680 
TCCACITTGG CAGGAGCCCT GACCCAGCTA GGAGGTAGTC TGGAGGGCTG GTCATTCACA 4740 
GATCCCCATG GTCTTCAGCA OACAAGTGAG GGTGGTAAAT GTAGGAGAAA GAGOCTTGGC 4800 
CTTAAGGAAA TCTTTACTCC TGTAAGCAAG AGCCAACCTC ACAGGATTAG GAGCTGGGGT 4860 
AGAACTGGCT ATOCTTGGGG AAGAGGCAAG OCCTGCCTCT GGCCGTGTCC ACCTTTCAGO 4920 
AGACTTTGAG TGGCAGGTTT GGACTTGGAC TAGATGACTC TCAAAGGCCC TTTTAGTTCr 4980 
GAGATTCCAG AAATCTGCTG CATTTCACAT GGTAOCTGGA AOCCAACAGT TCATGGATAT 5040 
CCACTGATAT CCATGATGCT GGGTGCCCCA GCGCACACGG GATGGAGAGG TGAGAACTAA 5100 
TGCCTAGCTT GAGGGGTCTG CAGTCCAGTA GGGCAGGCAG TCAGGTCCAT GTGCACTGCA 5160 
ATGCCAGGTG GAGAAATCAC AGAGAGGTAA AATGGAGGCC AGTGCCATTT CAGAGGGGAG 5220 
GCTCAGG AAG GCTTCTTGCT TACAGGAATG AAGGCTGGGG GCATTTTGCT GGGGGGAGAT 5280 
GAGGCAGCCT CTGGAATGGC TCAGGGATTC AGCCCICCCT GCCGCTGCCT GCTG AAGCT G 5340 
GTGACTACGG GGTCGCCCIT TGCTCACGTC TCTCTGGCOC ACTCATGATG GAGAAGTGTG 5400 
GTCAGAGGGG AGCAATGGGC TTTGCTGCTT ATGAGCACAG AGGAATTCAG TCCOCAGGCA 5460 
GCOCTGOCTC TGACTCCAAG AGGGTGAAGT CCACAGAAGT GAGCTOCTGC CTTAGGGCCT 5520 
CATTTGCICr TCATCCAGGG AACTGAGCAC AGGGGGCCTC CAGGAGAOCC TAGATGTGCT 5580 
CGTACTOCCT CGGCCTGGGA TTTCAGAGCT GGAAATATAG AAAATATCTA GCCCAAAGCC 5640 
TTCATTTTAA CAGATGGGGA AAGTGAGCCC CCAAGATGGG AAAGAACCAC ACAGCTAAGG 5700 
GAGGGOCTGG GGAGCCCCAC OCTAGCCCTT GCTGCCACAC CACATTGCCT CAACAACCGG 5760 
COCCAGAGTG CCCAGGCACT CCTGAGGTAG CTTCTGGAAA TGGGGACAAG TCCCCTCGAA 5820 
GGAAAGGAAA TGACTAGAGT AGAATGACAG CTAGCAGATC TCTTCCCTCC TGCTCCCAGC 5880 
GCACACAAAC CCGCOCTCCC CTTGGTGTTG GCGGTCCCTG TGGCCTTCAC TTTGTTCACT 5940 
ACCTGTCAGC CCAGCCTGGG TGCACAGTAG CTGCAACTCC OCATTGGTGC TACCTGGCTC 6000 
TOCrOTCTCT GCAGCTCTAC AGGTGAGGCC CAGCAGAGGG AGTAGGGCTC GCCATGTTTC 6060 
TGGTGAGCCA ATTTGGCTGA TCTTGGGTGT CTGAACAGCT ATTGGGTCCA OCCCAGTOCC 6120 
TTTCAGCTGC TGCTTAATGC CCTGCTCrCT CCCTGGCCCA OCTTATAGAG AGOCCAAAGA 6180 
GCTCCTGTAA GAGGGAGAAC TCTATCTGTG GTTTATAATC TTGCAOGAGG CACCAGAGTC 6240 
TCCCTGGGTC TTGTGATGAA CTACATTTAT CCCCTTTCCT GCCCCAACCA CAAACTCTTT 6300 
CCTTCAAAGA GGGCCTGCCT GGCTCCCTCC ACCCAACTGC ACCCATGAGA CTCGOTCCAA 6360 
GAGTCCATTC CCCAGGTGGG AGCCAACTQT CAGGG AGGTC TTTCCCACCA AACATCTITC 6420 
AGCTGCTGGG AGGTGACCAT AGGGCTCTGC TTTTAAAGAT ATGGCTGCTT CAAAGGCCAG 6480 
AGTCACAGGA AGGACTTCTT CCAGGGAGAT TAGTGG TGAT GGAGAGGAGA GTTAAAATGA 6540 
CCrCATGTCC TTCTTGTOCA OGGTTTTGTT GAGTTTTCAC TCTTCTAATG CAAGGGT CTC 6 600 
ACACTGTGAA CCACTTAGGA TGTGATCACT TTCAGGTGGC CAGGAATGTT QAAT OTCTTT 6660 
GGCTCAGTTC ATTTAAAAAA GATATCTATT TGAAAGTTCT CAGAGTTGTA CATATGTTTC 6720 
ACAGTACAGG ATCTGTACAT AAAAGTTTCT TTCCTAAACC ATTCACCAAG AGCCAAT ATC 6780 
TAGGCATTTT CTTGGTAGCA CAAATTTTCT TATTG CTTAG AAAATTGTCC TCCITGTTAT 6840 
XTCTGTTTGT AAGACTTAAG TGAGTTAGGT CTTTAAGGAA AGCAACGCTC CTCTGAAATG 6900 

crr GTc r rn 1 rcrorre cc gaaatagcto ontx:i i rnc ggg agtt aga tgtatag agt 6960 

GTTTGTATGT AAACATTTCT TGTAGGCATC ACCATGAACA AAG ATATATT TTCTATTTAT 7020 
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TTATTATATG TGCACTTCAA GAAGTCACTG TCAGAGAAAT AAAGAATflGT CTTAAAIGTC? 

Seqn>ND:16 ProtemseqggKe: 
Protein Accession ft XPJ05186O2 

1 11 21 31 41 51 
L 1 1 ' • I 

MGAAQRQDFL F KAMLTTSW L TLTCTPGATS TVAAGCPDQS PELQPWNPQH DQDHHVHIGQ 60 
GKTLLLTSSA TVYSIHISEG GKLVKDHDE PTVLRTR1UL IDNGGELHAG SALCPFQGNF 120 
TULYGRADB GIQPDPYYGL KYIGVGKGGA LELHGQKKLS WTFLNJCTLHP GGMAEGGYFF 180 
ERSWGHRGVI VHVTDPKSGT VIHSDRFDTY RSKKESERLV QYLNAVPDGR ILSVAVNDEG 240 
SRNLDDMARK AMTKLGSKHF LHLGFRHPWS FLTVKGNPSS SVEDHDBYHG HRGSAAARVF 300 
KLFQIEHGEY FNVSLSSEWV QDVEWTEWFD HDKVSQTKGG EKEDLWKAH PGKKNRPTD 360 
IQATTMDGVN LSTEWYKKG QJDYRFACYDR GRACRSYRVR FLCQKPVRPK LTVTI0INVN 420 
STILNLEDNV QSWKPGDTLV IASTDYSMYQ AEEFQVLPCR SCAPNQ VKVA GXFMYLHIGE 480 
HDGVDMRAE V0IX5RNIIVMGEMEDKCYP YRNHICNFFD FDTFGGHKF ALGFKAAHLE 540 
GTELKHMGQQ L VCQYH HFH LAGDVDERGG YDPPTYIRDL SrHBTTSRCV TVHGSNQLII 600 
KDWGYNSLG HCFFTEDGPE ERNTFDHCLG LLVKSGTLLP SDRDSKMCKM ITEDSYPGYI 660 
PKPRQPCKAV SIFWMANPNN N UNCAA AGS bHl lib WFIFH HVPTGPSVGM YSPGYSEHIP 720 
LGKFYNNRAH SNYRAGMIID NGVKTTEASA KDKRPFLSH SARYSPHQDA DPLKPREPAI 780 
IRHFIAYKNQ DHGAWLRGGP VWLDSCRFAP NGK5LTLASO OTFFYDPGSK QEKKSLPVG 840 
ESGNVGTEMM DNRIWGPGGL DHSGRJLPKj QNFPIRGIQL YDGMNIQKC TFRKFVALEG 900 
RHTSALAFRL NNAWQSCPHN NVTG1AFEDV PTTSRVFFGE PGPWFNQLDM DGDKTSVFHD 960 
VDGSVSEYPG SYLTKNDNWL VRHPDONVP DWRGAICSGC YAQMYIQAYK TSNLRMKDK 1020 
NDFPSH PLYL BGALTRSTHY QQYQPWTLQ KQYTTHWDQT APAELAIWU KFNKGDWRV 1080 
GLCYPRGTTF SILSDVHNRL LEQTSKTQVF VRTLQMDKVE QSYPGRSHYY WDEDSGULFL 1140 
KLKAQNEREK FAFCSMKGCE RDQKALIPK NAG VSDCTAT AYPKFTERAV VDVPMFKKLP 1200 
GSQLKTKDHF LEVKMESSKQ HFPHLWNDPA YffiVDGKKYP SSEDGIQVW 1DGNQGRWS 1260 
HTSFRNSLQ GIPWQLFNYV ATIPDNSTVL MASKGRYVSR GFWTRVLEKL GADRGLKLKE 1320 
QMAFVGFKGS FRPIWVTLDT EDHKAKIFQV VPIPWKKKK L 

SeqIDNO: 17 DNA sequence 

Nucleic Add Accession ft NM015515. 1 

Coding sequence 61-1329 

1 11 21 31 41 51 
I I I I I I 

AGTTCTGOGG TGCCAGGGAG TGGAGCAGAG CTCAGCCOCG TCOCAAACAC AGATGGQACC 60 
ATG AACTOCG GACACAGCTT CAGCCAGACC CCCTCGGCCT CCTTCCATGG CGO0GGAGGT 120 
GGCTGGGGOC GGOXAGGAG CTTOOOCAGO GCTCCCACCQ TCCATGGCGG TGCGGGGGGA 180 
GCCCGCATCT CCCTGTCCTT CACCACGCGG AGCTGCCCAC CCCCTGGAGG GTCTTGGGGT 240 
TCTGG AAGAA GCAGC000CT ACTAGGCGGA AATGGGAAGG OCACCATGCA GAATCTCAAC 300 
GACCGOCTGG OCTGCTAOCT GGAGAAGGTT OGOGOOCTGG AGGAGGCCAA CATGAAGCTG 360 
GAAAGCCGCA TCCTGAAATG GCACCAGCAG AGAGATCCTG GCAGTAAGAA AGATTATTCC 420 
CAGTATGAGG AAAACATCAC ACACCTGCAG GAGCAGATAG TGGATGGTAA GATGACCAAT 480 
GCTCAGATTA TTCTTCTCATTGACAATGCC AGGATGGCAG TGGATGACTT CAACCTCAAG 540 
TATGAAAATG AACACTCCTT TAAGAAAGAC TTGGAAATTG AAGTCGAGGG CCTCCGAAGG 600 
ACCTTAGACA AOCTG ACCAT TGTCACAACA GACCTAG AAC AGGAGGTGGA AGGAATGAGG 660 
AAAGAGCTCA TTCTCATGAA GGAGCACCAT GAGCAGGAAA TGGAGGAGCA TCATGTGCCA 720 
AGTGACTTCA ATGTCAATGT GAAGGTGGAT ACAGGTOCCA GGGAAGATCT GATTAAGGTC 780 
CTGGAGGATA TGAGACAAGA ATATGAGCTT ATAATAAAG A AGAAGCATCG AGACTTGG AC 840 
ACTTGGTATA AAGAACAGTC TGCAGCCATG TCCCAGGAGG CAGCCAGTCC AGCCACTGTG 900 
CAGAGCAGAC AAGGTGACAT CCAOGAACTG AAGOGCACAT TCCAGGCCCT GGAGATTGAC 960 
CTGCAGGCAC AGTACAGCAC GAAATCTGCT TTGGAAAACA TGTTATCCGA GACCCAGTCT 1020 
CGGTACTCCT GCAAGCTCCA GGACATGCAA GAGATCATCT CCCACTATGA GGAGGAACTG 1080 
AOGCAGCTAC GCCACG AACT GGAGCGGCAG AACAATG AAT ACCAAGTGCT GCTGGGCATC 1140 
AAAACCCACC TGGAGAAGGA AATCACCACG TACCGACGGC TCCTGGAGGG AGAGAGTGAA 1200 
GGGACACGGG AAGAATCAAA GTCGAGCATG AAAGTGTCTG CAACTCCAAA GATCAAGGCC 1260 
ATAAOCCAGG AGACCATCAA CGGA AGATTA G11C111U1C AAGTGAATGA AATCCAAAAG 1320 
CAOGCATGAG ACCAATGAAA GTTTOCGCCT GTTGTAAAGT CTATTTTDCC CCAAGGAAAG 1380 
TCC TTGCA CA GACACCAGT O AGTGAGTTCT AAAAGATACC CTTGG AATTA TCAGACTCAG 1440 
AAACTTTTAT TTTTTTTTTT CTGTAACAGT CTCACCAGAC TTCTCATAAT GCTCTTAATA 1500 
TATTGCACTT TTCTAATCAA AGTGCGAGTT TATGAGGGTA AAGCTCTACT TTCCTACTGC 1560 
AGCCTTCAGA TTCTCATCAT TTTGCATCTA TTTTGTAGCC AATAAAACTC CGCACTAGC 



Sea mNa. 18 Protein secmence; 
Protein Accession ft NP_056330.1 

1 11 21 31 41 51 

I I I I I I 

MNSGHSFSOT PSASFHGAGG GWGRPRSFPR APTVHGGAGG ARISLSFTTR SCPFPGGSWG 60 
SGRSSPIXGG NGKATMONLN DRLASYLEKV RALEEANMKL ESRILKWBQQ RDPGSKKDYS 120 
QYEENTTHLQ EQIVDGKMTN AQHLUDNA RMAVDDFNLK. YENEHSFKKD LEIEVEGLRR ISO 
TLDNLTIVTT DLEQSVBGMR KEULMKEHH BQEMEEHHVP SDFNVNVKVP TGPREPLHCV 240 
LEDMRQEYEL HKKKHRDLX) TWYKEQSAAM SQEAASPATV QSRQGDfflEL KRTFQALEID 300 
LQAQYSTKSA LENMLSETQS RYSCKLQDMQ KHSHYEEEL TQLRHELERQ NNEYQVLLGI 360 
KTHLEKEITT YRRIXEGESE GTREESKSSM KVSATPKIKA ITQETINGRL VLCQVNEIQK 420 



SeqIDNO: 19 DNA sequence 

Nucleic Acid Accession ft see Table 25 & 25 A complete Hst 

1 U 21 31 41 51 
I I I I I I 
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1 1 A 1 HI TIT TTAAAAAAAA GAGGCTTGGT AAOnTTTGA TACTT AGTTG ACTTTTAGCA 60 
TTATCCAGCA TTTQTATTAT GAACCAGTGA GTACTGTAAT TTTTCTTTCC CTTTCAGAAA 120 
GACTCAAAGO GAACATATAA ATGTTTCCTA 11 1 1 1NNNNN NNNNNNNNNN NNNNNNNNNN 180 
NNNNAOCCAT CGTGCGATGA TCNNNNNNNN NNNNN>0«INNN NNNNNTTGGG ATCCAGTTTC 240 
AAATAAGGTA TGGGAAAAAC AGATGTTTTC ATTATCGCCA CTTAATCCTT ACTTCCGATT 300 
ATAATTATAC ATGTTTGGCT GTAATAACTA TACTAAAGCA TGCTTGTGAA AGTAGACTTC 360 
TACAAGG ACA GAAAACCCAC AACAACAAAG ATCGATCAOG AAAG ACAAGG CATATTCATT 420 
CATTAATTTA CTICTCTTAG ACCCGGGACA TGTGGGACAA ATACTTTTGT CCTCATGGAT 480 
GGCTTGATAA TTTATTTATA TGTTCTAGAO TCTGAGGATT TTCTTTCAGT GGCAGACAAC 540 
AAAGGATGTT ACAATTTACT TCAAAATAAT ACAATCATGG TTTAATTTAC AGTGTAAATC 600 
CATAACTATT TTATAG AG AT GG ATTATCAT ACATG GG ATT AT AAAAATAA CTT ACCCATA 660 
TGCTTGCAAA ATAGACTTTT CCTATTGGGA GGAACATCTT TTAAOCTAAA ACGGATTTAT 720 
TTCAGATGAA TTAGACAGTA CATTTTTCAG GAGAAOCAGC CTTACTGGAT GAT C1TH GT 780 
CAGGTTTGGA GGCCTCTTCT TTGTCTTTGC AAOCATAACC CXmTTCAGC TGAAGAOCAC 840 
TGGCCTTCAA COCAAGCCAG GAGTTTGGCT CAAATGA 

Seq ID NO? 20 DNA sequence 

Nndeio Add Accession #: D3205M 

Coding sequence: 72-1373 



1 11 21 31 41 51 
I I I I I I 

GAATTCG AAC CAGGTGGCCA CCCGGTGTCG GTTTCATTTT CCTTTGG AAT TTCTGCTTTA 60 
CAGACAG AAC AATGGCAGCC OGAGTACTTA TAATTGGCAG TGG AGG AAGG GAACATAOGC 120 
TGGCCTGGAA ACTTGCACAG TCTCATGATG TCAAACAAGT GTTGGTTGOC OCAGGAAAOG 180 
CAGGCACTGC CTGCTCTGAA AAGATTTCAA ATACOGCCAT CTCAATCAGT GACCACACTG 240 
CCCTTGCTCA ATTCTGCAAA GAGAAGAAAA TTGAATTTGT AGTTGTTGGA CCAGAAGCAC 300 
CTCTGGCTGC TGGGATTGTT GGGAAOCTGA GGTCTGCAGG AGTGCAATGCTTTGGCCCAA 360 
CAGCAGAAGC GGCTCAGTTA GAGTCCAGCA AAAGGTTTGC CAAAGAGTIT ATGGACAGAC 420 
ATGGAATCCC AACCGCACAA TGGAAGGCTT TCACCAAACC TGAAGAAGOC TGCAGCTTCA 480 
TTTTGAGTGC AGACTTCCCT GCTTTGGTTG TGAAGGOCAG TGGTCTTGCA GCTGGAAAAG 540 
GGGTGATTGT TGCAAAGAGC AAAGAAGAGG OCTGCAAAGC TGTACAAGAG ATCATGCAGG 600 
AGAAAGCCTT TGGGGCAGCT GGAGAAACAA TTGTCATTGA AGAACTTCTT GACGGAGAAG 660 
AGGTGTCGTG TCTGTGTTTC ACTGATGGCA AGACTGTGGC CCCCATGCCC CCAGCACAGG 720 
AOCATAAGOG ATTACTGGAG GGAGATGGTG GCOCTAACAC AGGGGGAATG GGAGOCTATT 780 
GTCCAGCCCC TCAGGTTTCT AATGATCTAT TACTAAAAAT TAAAGATACT GT11T1 CAGA 840 
GGACAGTGGA TGGCATGCAG CAAGAGGGTA CTCCATATAC AGGTATTCTC TATGCTGGAA 900 
TAATGCTGAC CAAGAATGGC CCAAAAGTTC TAGAGTTTAA TTGCCXnTTT GGTGATOCAG 960 
AGTGOCAAGT AATCCTCCCA CTTCTTAAAA GTGATCTTTA TGAAGTGATT CAGTCCACCT 1020 
TAGATGGACT GCTCTGCACA TCTCTGCCTG TTTGGCTAGA AAACCACACC GCCCTAACTG 1080 
TTGTCATGGC AAGTAAAGGT TATCCTGGAG ACTACACCAA GGGTGTAGAG ATAACAGGGT 1140 
TTCCTGAGGC TCAAGCTCTA GGACTGGAGG TGTCCCATGC AGGCACTGCC CTCAAAAATG 1200 
GCAAAGTAGT AACTCATGGG GGTAG AGTTC TTGCAGTCAC AGCCATCCGG GAAAATCTCA 1260 
TATCAGCCCT TGAGGAAGCC AAG AAAGGAC TAGCTGCTAT AAAGTTTGAG GGAGCAATTT 1320 
ATAGGAAAGA CATCGGCTTT OGTGOCATAG CTTTOCTOCA GCAGCCCAGG TAAAACTCTA 1380 
AGCAAGTTAG CTGTAGTGOC ATTTCAGAAA CTGGOCTAAA TGGCTATGTA GAACATTOCA 1440 
TTAACCCTAT AAGTCA7TCA GTATTCTnT CTCTCTGTGG GAGTGATACA GTCTTGGTTT 1500 
GTATTT TGTT T O AAT CAAAA CTGGTTATAG CAATACTCAA A TGGAAA AAA CTTC ATOATA 1560 
GCGTAAGTTT GGAAAGTTT A GCAAAATCAC AGTGGTACTG ATTTTTATTT GTTTTCTATT 1620 
IHITIA TTT TAT A1 1 H ' l ' A All U 1 1 I A ACAOGG ICl 1C CTCICI O GOC CAAGTTCTCA 1680 
TGCCTCAGCC TCCCAAATAG CTGGGACTAC AGGCACAGGC CACCACACCT GGCTAATTTT 1740 
TTTQT AT1 1 i J i^ lGGAGAT GGGGTTCAOC ATGTTGCCAA GGCCAGTCTQ AAAGCCTGGG 1800 
CTCAAGTG AT CCTCCTGCTT TGGCCTCCCA AAATGCTGGG ACTATAGGCA TGAGGOGCTG 1860 
CACTTGGCCT GATACTGATT TTTATTCCTr GCGTTATCAC ATAGTGTTGT ATTTGAAACA 1920 
TAGTTCATGG TTTTATCAAA GAACTGAAGA TG AGAATACT GGTCATCTAA CITTGTAATT 1980 
TGATTTGATT ATACTGTAAA GTTTGACAGT CCCATTTTAA CCTGCGTTTG TATCTATTAC 2040 
TAAAATGTAT TTTTTG AOCT CTTACTGATT CATGGTTGGT ATGTACAAAC TGTTG ACTTG 2100 
TAAAATCAAT AAAGTCTTAG TTGG c 

Seq ID NO: 21 Protetq sequence: 
Protein Accession #: BAA06809.1 



I 11 21 31 41 51 
I I I I I I 

MAARVmGS GGREHTLAWK LAQSHHVKQV L VAPGNAGTA CSEEQSNTAI SISDHTALAQ 60 
FCKEKKIEFVWGPEAPLAA G1VGNLRSAG VQCFGPTAEA AQLESSKRFA KEFMDRHGIP 120 
TAQWKAFTKP EEACSFILSA DFPALWKAS GLAAGKG VIV AKSKEEACKA VQEIMQEKAF 180 
GAAGETTVIE ELLDGEEVSC LCFTDGKTVA PMPP AQDHKR LLEGDGGFNT GGMGAYCFAP 240 
QVSNDLLLKI KDTVLQRTVD GMQQEGTPYT GILYAG3MLT KNGPKVLEFN CRFGDPECQV 300 
ILPLLKSDLY EVIQSTLDGL LCTSLPVWLE MHTALTWMA SKGYPGDYTK GVETTGFPHA 360 
QALGLBVSHA GTALKNGKW THGGRVLAVT AIRENLISAL EEAKKGLAAI KFEGA1YRKD 420 
IGFRAIAFLQ QPR 



Seq ID NO: 22 DNA sequence 

Noclefc Acid Accession ft EOS cloned 

Coding seq u ence : 1-2424 



1 U 21 31 41 51 

I I I I I I 

ATGCCCCCTT TCCTGTTGCT GGAGGCOGTC TGTGTTTTCC TGTnTCCAG AGTGCCCCCA 60 

xctctocctc tocagg aagt ccatgtaagc aaagaaacca tcgggaag AT TTCAGCTGCC 120 

AGCAAAATGA TGTGGTGCTC GGCTGCAGTG GACATCATGT TTCTGTTAGA TGGGTCTAAC 180 
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AGCGTOGGG A AAGGGAGCIT TGAAAGGTCC AAGCACITTQ CCATCACAGT CTGTGAOGGT 240 

ctgoacatca goocooagag ggtca oagtg ggagcattoc agttcagttc cactcctcat - 300 
ctggaatto c gcttggattc attttcaacc caacaggaag tgaaggcaag aatcaagagg 360 

ATGG uuuA AAGGAGGGCG CAOGGAGACG GAACTTGCTC TCAAATACCT TCTGCACAGA 420 
GGGTrGOCTG GAGGCAGAAA TGU1 1C1GTU OCOCAGATCC TCATCATCGT CACTOATGGG 480 
AAGI T30CAGO GGGATGTGGC ACTGOCAT0C AAGCAGCTGA AGGAAAGGGO TCTCACICTfl 540 
TTTGCTGrGG GGGTC AGCTTT OOCAGGTGG GAGGAGCTGC ATGCACTGGC CAGCGAGOCT 600 
AGAGGGCAGC AOGTGCTGTT GGCTGAGCAG GTGGAGGATG OCACCAACGG CCTCTTCAGC 660 
ACCCTCAGCA GCTOGGCCATCTGCrOCAGC GCCACGOCAO ACTGCAOGGT OQAGGCICAC 720 
CCCIGTGAGC ACAGGAOGCT GGAGATGGTC CX3GGAGTTOG CTGGCAATGC CCCATGCTCG 780 
AQA GGATOG C GGOGGACCCT TGOGGTGCTG GCTGCACACT OTOOCI AC TA CAOCTOQAAG 840 
AOAGTGTTCC TAACCCACOC TKXACCTGC TACAGGAOCA CCTGCOCAGO CCCCTOTCAC 900 
TOGCAGOOCT GCC AGAAT GG AGGCACATOT GTTOCAGAAG GACTGGACGG CTACCAGTGC 960 
CTtnG<XXXK: TGGCCnT GG AGGGGAGGCT AACTGTGCOCTGAAGCTGAG CCTGGAATGC 1020 
AGGGTCGACC TOCTCT1 OCT GCTGGACAGC TCTGOGGGCA CCACICTGGA OGGCTTCCTO 1080 
CGGGCCAAAG TCTTCGTG AA GCGGTTTGTG CGGGCOGTGC TGAGCGAGGA CTCTCGGGCC 1140 
OGAQTOGGTG TGGCCACATA CAGCAOGGAO CTGCTGGTGG CGGTGOCTGT GGGGOAGTAC 1200 f 
CAGGATGTGC CTGACCTGGT CTGGAGCCTC GATGGCATTC CCTTCCGTGG TGGCCOCACC 1260 
CTGACGGGCA GTGCCITGCG GCAGGCGGCA GAGCGTGGCT TCGGGAGOGC CACCAGGACA 1320 
GGCCAGGACC GGCCACGTAG AGTGGTGGTTTTGCICACTG AGTCACACTC OGAGGATGAG 1380 
GTTGCGGGCC CAGCGCGTCA CGCAAGGGCG CGAGAGCTGC TOCTGCTGGG TGTAGGCAGT 1440 
GAGGCOGTGC GGG CAGAGCT GGAGGAGATC ACAGGCAGCC CAAAGCATGT GATCGTCTAC 1500 
TCGGATCCTC AGGATCTGTT CAACCAAATC CCTGAGCTGC AGGGGAAGCT GTGCAGCCGG 1560 
CAGCGGCCAG GGTGCCGGAC ACAAGCCCTG GAOCTOGTCT TCATGTTGGA CAOCTCTGCC 1620 
TCAGT AGGGC CCG AGAATTT TGCTCAGATG CAG AGCITTG TGAGAAGCTG TGCCCTCCAG 1680 
TTTG AGGTG A ACCCTGACGT GACACAGGTC GGCCTGGTGG TOTATGGCAG CCAGGTGCAG 1740 
ACTGCCTTCG GGCTGGACAC CAAAGOCAOC CGGGCTGCGA TGCTGOGGGC CATTAGCCAG 1800 
GOOOOCTAOC T AGGTGGGGT GGGCTCAGOC GGCAOOGOOC TGCTGCACAT CTATGACAAA 1860 
GTGATGACCG TCCAGAGGGG TGCCCGGCCT GGTGTCOOCA AAGCTGTGGT GGTGCTCACA 1920 
GGCGGGAGAG GCGCAGAGGA TGCAGCCGTT CCTGCCCAGA AGCTGAGGAA CAATGGCATC 1980 
TCTGTCTTGG TOGTGGGCGT GGGGCCTGTC CTAAGTGAGG GTCTGOGGAG GCTTGCAGGT 2040 
OCCCGGGATT CCCTG ATCCA CGTGGCAGCT TAOGOOGAOC TGOGGTACCA CCAGGACGTG 2100 
CTCATTGAGT GGCTGTGTGG AGAAGCCAAG CAGCCAGTCA AOCTCTGCAA ACCCAGCCCG 2160 
TGCATGAATG AGGGCAGCTG CGTOCTGCAG AATGGGAGCT AOCGCTGCAA GTGTCGGGAT 2220 
GGCTGGGAGG GCCCCCACFG OGAGAACCGT GAGTGGAGCT CTTGCrCTGT ATGTGTGAGC 2280 
CAGGGATCGA TTCTTGAGAC GCCCCTGAGG CACATGGCTC COGTGGAGGA GGGCAGCAGC 2340 
CGTAOCCCTC CCAGCAACTA CAGAGAAGGC CTGGGCACTG AAATGGTGOC TACCTTCTGG 2400 
AATGTCTGTG CCCCAGGTCCTTAG 

Scq ID NO: 23 Protein sequence: 
Protein Accession ft HQS cloned 



1 11 21 31 41 51 
I I I I I I 

MPPFLLLEAV CVFLFSRVPP SLPLQHVHVS KETIGKISAA SKMMWCSAAV DIMFLLDGSN 60 
SVGKGSFERS KHFATTVCDG LDISPERVRV GAFQFSSTPH LEFPLDSFST QQEVKARJKR 120 
MVFKGGRTET ELALKYLLHR GLPGGRNASV PQILIIVTDG KSQGDVALPS KQLKHRGVTV 180 
F AVGV RFPRW EELHALASEP RGQHVLLAEQ VEDATNGLFS TLSSSAICSS ATPPCRVEAH 240 
PCEHRTLEMV REFAGNAPCW RGSRRTLAVL AAHCPFYSWK RVFLTHPATC YRTTCPGPCD 300 
SQPCQNGGTC VPEGLDGYQC LCPLAFGGEA NCALKLSLEC RVDLLFLLDS SAGTTLDGFL 360 
RAKVFVKRFV RAVLSEDSRA RVGVATYSRE LLVAVPVGEY QDVPDLVWSL DGIPFRGGPT 420 
LTGSALRQAA ERGFGSATRT GQDRPRRVW LLTESHSEDE VAGPARHARA RELLLLGVGS 480 
EAVRAELEEI TGSPKHVMVY SDPQDLFNQI PELQGKLCSR QRPGCRTQAL DLVFMLDTSA 540 
SVGPBNFAQM QSFVRSCALQ PEVNPDVTQV GLWYGSQVQ TAFGLDTKPT RAAMLRA1SQ 600 
APYLGGVGSA GTAIXHIYDK VMTVQRGARP GVPKAVWLT GGRGAEDAAV PAOKLRNNGI 660 
SVL WGVGPV LSEGLRRLAG PRDSLIHVAA YADLRYHQDV UEWLCGEAK QPVNLCKPSP 720 
CMNEGSCVLQ NGSYRCKCRD GWEGPHCENR EWSSCSVCVS QGWILETPLR HMAPVQEGSS 780 
R3PPSNYREG LGTEMVPTFW NVCAPGP 

Seq ID NO: 24 DNA sequence 

Nodeio Acid Accession #: see Tabic 25 & 25 A for complete list 

I II 21 31 41 51 
I I I I I I 

AGGTCGGCTG GTTATCGGGA GTTGGAGGGC TGAGGTCGGG AGGGTGGTGT GTACAGAGCT 60 
CTAGGACTCA OGCACCAGGC CAGTOGCGGG TTTTGGGCCG AGGCCTGGGT TACAAGCAGC 120 
AAGTGCGCGG TTGGGGOCAC TGCG AGGOCG TTTTAGAAAA CTGTTTAAAA CAAAGAGCAA 180 
TTG ATGG ATA AATCAGGAAT AGATTCTCTT GACCATGTGA CATCTGATGC TGTGGAACTT 240 
GCAAATCG AA GTGATAACTC TTCTGATAGC AGCTTATTTA AAACTCAGTG TATOOCTTAC 300 
TCACCTAAAG GGGAGAAAAG AAACCCCATT CGAAAATTTG TTCGTACACC TGAAAGTGTT 360 
CACGCA AGTA TTCATCAAGT GACTCATCTT TTGAACCAGT ACCATTGACT ATAAAAGCTA 420 
TTTTTGAAAG ATTCAAGAAC AGGAAAAAGA GATATAAAAA AAAG AAAAAG AGGAGGTACC 480 
AGCCAACAGG AAGACCACGG GG AAGACCAG AAGG AAGGAG AAATOCTATA TACTCACTAA 540 
T AGATAAGAA GAAACAATTT AGAAGCAGAG GATCTGGCTT OCCATTTTTA GAATCAGAGA 600 
ATGAAAAAAA CGCACCTTGG AGAAAAATTT TAACGTTTGA GCAAGCTGTT GCAAOAGG AT 660 
TTTT TAACTA TATTGAAAAA CTGAAGTATG AACAOCACCT GAAAG AATCA TTGAAGCAAA 720 
TGAATGTTGG TGAAGATTTA GAAAATGAAG ATTTTGACAG TCGTAGATAC AAATTTTTGG 780 
ATGATGATGG ATCCATTTCT CCTATTGAGG AGTCAAOGCT TTTATCTTGA GGACATGGTG 840 
TCTGGAGTTA AAGGTATTGG CATACTCCAC ACATCTGTAC CATTCTTGAG TGATOGCTTA 900 
GGAATGAATG TGATTTGGAC TCATTCATGT ATGAGAGTAA GCAATGCTTT TTTTTCCAGG 960 
GTGT CAAA TT GAGAACCAGG TAGATCCOCA OCACCTACAG TAAAAAGGAC CCTAAAGTAA 1020 
ATTGGTTGAA GAAATTAGAT OCCAAAGATT CTTGGTGAAT TTTGAAGTCT TCATCAGTAT 1080 
ATC CATAT TA AAAOGAGATG ACAGAAGCCA AAGTAATTAT GGGCTGACAG GACAACTGGA 1140 
TCAGTTICAT TAAAAAGGGC AAACITGAAG ATAAATCTTT TGACTCCAGC TCTTTAGAGG 1200 
ATCTAAAGTG ACCITGATGG ACAGTGGAAG AAATCACAAC ATGG AATTCC TCGAATAACA 1260 
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ATTTATTGAC TTTAAATAAT TTTGTCTAAT GCTACATATA CACAATTAAA AAACCTTTAC 1320 
ACTATTTCTA O AAAOTCAGC ATGTATTTTT GGCTCG AAGT TTCTCTAGTG TTTTCTOTGG 1380 
AAGQAATAAA AATTTG AGGT TTCAATACAA AAACAAAACA AACAACACGA AACACGAAAA 1440 
ACAA TCT Gl 1 OTCOOGOGOC CXTGGGCCCC TTGAGAGAAA A CTTTTT AGA ACOOCTTnO 1500 
CGTTGTGGCO GCCCGGGGGC CCCACAGTTQ GGTTTAGGTG OGCAOCCTTO TGTCTACAAG 1560 
TGGTGTCTCC GCAAGAGAQA GAACACCTCC GGGGTCAAGC GOACAACAAG AGTGCGTCGT 1620 
GAGGACTCTT CAGOCAAAGT ATATAAAAOC OGCOOOGOGG GGGAACCACC GGOCGCITTT 1680 
CTGTAGACAC AACOCCCACA GTGGGAAOCT CTGAGGGCGC ACAGACAGGG CGAGCCTTAT 1740 
CAACAAGGGG TGCCCAACAG AAACCCCGAG TTAAAAATCG 



Scq ID NO, 25 DNA sequence 

Nucleic Arid Accession ft BO001972.I 

Coding sequence: 183-1019 

1 11 21 31 41 51 
I I I I I I 

GGTCGGCTGG TT ATCGGOAG TTGG AGGGCT GAGGTCGGGA GGGTGGTGTG TACAGAGCTC 60 
TAGGACTCAC GCACCAGGCC AGTOGOGGGT TTTGGGGOGA GGCCTGGGTT ACAAGCAGCA 120 
AGTGGGCGGT TGGGGCCACT GCGAGGCCGT TTTAGAAAAC TGTTTAAAAC AAAGAGCAAT 180 
TG ATGGATAA ATCAGGAATA GATTCTCTTG ACCATGTGAC ATCTGATGCT GTGGAACTTG 240 
CAAATCGAAG TGATAACTCT TCTGATAGCA GCTTATTTAA AACTCAGTGT ATCCCTTACT 300 
CACCTAAAGG GGAG AAAAGA AACCCCATTC GAAAATTTGT TCGTACACCT GAAAGTGTTC 360 
ACGCAAGTGA TTCATCAAGT GACTCATCTT TTGAACCAAT ACCATTGACT ATAAAAGCTA 420 
TTTTTG AAAG ATTCAAG AAC AGO AAAAAG A G ATATAAAAA AAAG AAAAAG AGGAGGTACC 480 
AGCCAACAGG AAGACCACGG GGAAGACCAG AAGGAAGGAG AAATCCTATA TACTCACTAA 540 
TAGATAAGAA GAAACAATTT AGAAGCAGAG GATCTGGCTT CCCATTTTTA GAATCAG AGA 600 
ATGAAAAAAA CGCACCTTGG AG AAAAATTT TAAOGTTTGA GCAAGCTGTT GCAAGAGGAT 660 
TTTTTAACTA TATTGAAAAA CTGAAGTATG AACACCACCT GAAAGAATCA TTGAAGCAAA 720 
TGAATGTTGG TGAAGATTTA GAAAATGAAG ATTTTGACAG TCGTAGATAC AAATTTTTGG 780 
ATGATGATGG ATCCATTTCT CCTATTGAGG AGTCAACAGC AGAGGATG AG GATGCAACAC 840 
ATCTTGAAGA TAACGAATGT GATATCAAAT TGGCAGGGGA TAGTTTCATA GTAAGTTCTG 900 
AATTCCCTGT AAGACTGAGT GTATACTTAG AAGAAGAGGA TATTACTGAA GAAGCTGCTT 960 
TGTCTAAAAA GAGAGCTACA AAAGCCAAAA ATACTGGACA GAGAGGCCTG AAAATGTG AC 1020 
AGG ATCATG A ATGTCAAAGG CTTTTATCTT G AG AACATGG TGTCTGGAGT TAAAGG ACTA 1080 
TTGTT AGATC TGTGGGAAGG AATTACAAGA CAGTTGCTAA AAGTTTGAAA AAGACGGTTG 1140 
CT AAACGTT A TG AAAAACCA G AT AATCT AC TTTTTT ACCT TAGGTATTGG CATACTOCAC 1200 
ACATCTGTAC CATTCTTGAG TGATCGCTTA GGAATGAATG TGATTTGAAC TCATTCATGT 1260 
TG AGAGGGTG TCAAATTGAG AACCAGGTAG ATCCCCACCA CCTACAGTAA AAAGGACCCT 1320 
AAAGTAAATT GGTTGAAGAA ATTAGATCCC AAAGATTCTT GGTGAATTTT GAAGTCTTCA 1380 
TCAGTATATC CATATTAAAA CGAOATGACA GAAGCCAAAG TAATTATGGG CTGACAGGAC 1440 
AACTGGATCA GTTTCATTAA AAAGGGCAAA CTTGAAGATA AATCTTTTGA CTCCAGCTCT 1500 
TTAGAGGATC TAAAGTGACC TTGATGGACA GTGGAAGAAA TCACAACATG GAATTCCTCG 1560 
AATAACAATT TATTG ACTTT AAATAATTTT GTCTAATGCT ACATATACAC AATTAAAAAA 1620 
CCTTTACACT AAAAAAAAAA AAAAAA 

Scq ID NO: 26 Protein, sequence: 
Protein Accession #: AAH01972.1 

1 11 21 31 41 51 
11)111 

MDKSGIDSLD HVTSDAVELA NRSDNSSDSS LFKTQCIPYS PKGEKRNPIR KFVRTPESVH 60 
ASDSSSDSSP EHFLTDCAI FERFKNRKKR YKKKKKRRYQ PTGRPRGKPE GRRNPIYSU 120 
DKKKQFRSRG SGTPFLESEN EKN APWRKU , TFEQAVARGF FNYIEKLKYB HH1 JCKSLKQM 180 
NVGEDLENED FDSRRYKFLD DDGStSFIEE STAEDEDATH LEDNECDIKL AGDSFIVSSE 240 
FPVRLSVYLE EEDITEEAAL SKKRATKAKN TGQRGLKM 



Seq ID NO 27 DNA sequence 

Nucleic Arid Accession ft AK027016 

Coding sequence: 207-1043 

1 II 21 31 41 51 
I I I I I I 

CTil 111 ICC GCACGGTTGG AGGAGGTCGG CTGGTTATCG GGAGTTGGAG GGCTGAGGTC 60 
GGGAGGGTGG TGTGTACAGA GCTCTAGGAC TCACGCACCA GGCCAGTCGC GGATTTTGGG 120 
CCGAGGCCTG GGTTACAAGC AGCAAGTGCG CGGTTGGGGC CACTGCGAGG CCGTTTTAGA 180 
AAACTGTTTA AAACAAAGAG CAATTGATGG ATAAATCAGG AATAGATTCT CTTGACCATG 240 
TGACATCTGA TGCTGTGG AA CTTGCAAATC G AAGTGATAA CTCTTCTGAT AGCAGCTTAT 300 
TTAAAACTCA GTGTATCCCT TACTCACCTA AAGGGGAGAA AAGAAACCCC ATTCGAAAAT 360 
TTGTTCGTAC ACCTGAAAGT GTTCACGCAA GTGATTCATC AAGTGACTCA TCTTTTG AAC 420 
CAATACCATT G ACTAT AAAA GCTATTTTTG AAAGATTCAA GAACAGGAAA AAGAGATATA 480 
AAAAAAAGAA AAAGAGGAGG TACCAGCCAA CAGGAAGACC ACGGGGAAGA CCAGAAGGAA 540 
GGAGAAATCC TATATACTCA CTAATAG ATA AG AAGAAACA ATTTAGAAGC AGAGGATCTG 600 
GCTTCCCATT TTTAGAATCA G AGAATGAAA AAAACGCACC TTGGAGAAAA ATTTTAACGT 660 
TTGAGCAAGC TGTTGCAAGA GGATTTTTTA ACTATATTGA AAAGCTGAAG TATG AACACC 720 
ACCTGAAAGA ATCATTGAAG CAAATGAATG TTGGTG AAGA TTTAGAAAAT GAAGATTTTG 780 
ACAGTCGTAG ATACAAATTT TTGGATGATG ATGGATCCAT TTCTCCTATT GAGGAGTCAA 840 
CAGCAGAGG A TGAGGATGCA ACACATCTTG AAGATAACGA ATGTGATATC AAATTGGCAG 900 
GGGATAGTTT CATAGT AAGT TCTOAATTCC CTGTAAGACT G AGTGTATAC TTAG AAGAAG 960 
AGGATATTAC TOAAGAAGCT GCTTTGTCTA AAAAGAGAGC TACAAAAGCC AAAAATACTG 1020 
OACAGAGAGG OCTGAAAATG TGACAGGATC ATGAATGTCA AAGGCTTTTA TCTTG AGAAC 1080 
ATGGTGTCTG GAGTTAAAGG TATTGGCATA CTCCACACAT CTGTACCATT CTTGAGTGAT 1140 
CGCTTAGGAA TGAATGTGAT TTGAACTCAT TCATGTTGAG AGGGTGTCAA ATTGAGAACC 1200 
AGGTAGATCC CCACCACCTA CAGTAAAAAG GACCCTAAAG TAAATTGGTT G AAGAAATTA 1260 
QATCCCAAAG ATTCTTGGTG AATTTTGAAG TCTTCATCAG TATATCCATA TT AAAACGAG 1320 
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ATGACAGAAG OCAAAGTAAT TAT6GCAAGT AATGGTTTTT ATCTTAACTA TAAGTTATTT 1380 
GCICAAGGGT GTAATGGTCA TTACCAAGGC TTTTAOAATG CAGTTTCTCA 1 ITGCTQTGG 1440 
ACATGACCAT AAAAAAAAAT TTOCCAGTAG GIITIUrATC TGCTACGTTG CTAGCAATCA 1500 
GCTTATTGGG A ACAGT TGAT TAACTCTAAT AGAAATGCAA TACAAATAAA ATQTGAAOCA 1560 
CATGTGATTT TTCTTTAAAA TCAGTOAGAT TTOAAAATTC TCCTAGATCT CTTGAATCAT 1620 
GCAAATTTGC TTTSOCTTTA TATTGTAAOC crTO TO Utf I T GCTAATAAOC AAGCAGTTTG 1680 
TAGTAGAGTT AACTCAGGCT COTTCTAGGG ACTCATTCAT GTTCACTCA C TGTACACTCA 1740 
TCICroGAAA TGTAAAATTT ACTTTTATAC TAITUTTATG TAGGGCTGAC AGGACAACTG 1800 
GATCAGTTTC ATTAAAAAGG TATGTATGCA TTAGAAAAGA CATTTGTATG GGTCATTTCA 1860 
AAGAGGGCTT ATGAGCCTGT GAAACCCAGA CCICTTAAOG CTGTOACCAA AGATGGAAOT 1920 
TCTCTATAGG AAGCCATAGC ACTCCTAATG TTTGGTGCTA TGTTTTCCTG AGGAGATATA 1980 
AAACGTAATA ATCCATGATT GTTGCCATOT GAQAGTTTTA AAGGTTAATC AAAATTTCTC 2040 
TTCTTCAGGG CAAACTTGAA GATAAATCTT TTGACTCCAG CTCTTTA GAG GATCTAAAGT 2100 
GAOC1TQATG GACAGTGGAA GAAATCACAA CATGGAATTC CTOGAATAAC AATTTATTGA 2160 
CTTTAAATAA TTTTGTCTAA TGCTACATAT ACACAATTAA AAAACCTTTA CACTATTTCT 2220 
AG AAAGTCAG CATGTATTTT TGGCTCGAAG TTTCTCTAGT GTTTTCTGTG GAAGGAATAA 2280 
AAATTTGAGT TTCAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAA 

Scq ID NO: 28 Protein sequence 
Protein Accession tk BAB 1 5628.1 

1 H 21 31 41 51 
I I I I I I 

MDKSGK)SLD HVTSDA VELA NRSDNSSDSS LFKTQCIPYS PKGEKRNP1R KFVRTPESVH 60 
ASDSSSDSSF EPIPLTKAI FERFKNRKKR YKKKKKRRYQ PTGRPRGRPE GRRNPIYSLI 120 
DKKKQFRSRG SGFPFLESEN EKNAPWRKIL TFEQA VARGF FNYIEKLKYB HHLKESLKQM 180 
NVGEDLENED FDSRRYKFLD DDGStSPIEE STAEDEDATH LEDNECDIKL AGDSFIVSSB 240 
FFVRLSVYLE EEDITEEAAL SKKRATKAKN TGQRGLKM 



Scq ID NO: 29 DNA sequence 

Nocidc Acid Accession fk KMJ0O4289J 

Coding sequence 493-1695 

1 11 21 31 41 51 
I I I I I I 

GCCGCCGOCT CGTCCACCGG AGGAGCCGGC GCCAGCGTGG ACGGCGGCAG CCAGGCTGTG 60 
CAGGGGGGCG GCGGGGACCC CCGAGCGGCT OGGAGTGGCC CCTTGGACGC CXjGGGAAGAG 120 
GAGAAGGCAC OCGCXjGAAOC GAOGGCTCAG GTGOOGGACG CTGGCGGATG TGCGAGOGAG 180 
GAGAATGGGG TACTAAGAGA AAAGCACXjAA GCTGTGGATC ATAGTTCCCA GCATGAGGAA 240 
AATGAAGAAA GGGTGTCAGC OCAGAAGGAG AACrCACTTC AGCAGAATGA TGATGATGAA 300 
AACAAAATAG CAGAGAAACC TGACTGGGAG GCAGAAAAG A CCACTGAATC TAGAAATGAG 360 
AGACATCTGA ATGGGACAGA TACTTCTTTC TCTCTGGAAG ACTTATTCCA GTTGCTTTCA 420 
TCACAGCCTG AAAATTCACT GGAGGGCATC TCATTGGGAG ATATTCCTCT TCCAGGCAGT 480 
ATCAGTGATG GCATGAATTC TTCAGCACAT TATCATGTAA ACTTCAGCCA GGCTATAAGT 540 
CAGGATGTGA ATCTTCATGA GGCCATCTIG CTTTGTCCCA ACAATACATT TAGAAGAGAT 600 
CCAACAGCAA GGACITCACA GTCACAAGAA CCATITCTGC AGTTAAATTC TCATACCACC 660 
A ATCCTG AGC AAACCCTTCC TGGAACTAAT TTGACAGGAT TTCTTICACC GGTTGACAAT 720 
CATATGAGGA ATCTAACAAG CCAAGACCTA CTGTATGACC TTGACATAAA TATATTTGAT 780 
GAGATAAACT TAATGTCATT GGGCACAG AA GAGAACITTG ATCCAATCGA TGTTTCICAG 840 

1 1 1 iGATG AACCAGATTC TQATTCTGGC Ul 1 1C1 ITAG ATTCAAGTCA CAATAATACC 900 
TCTGTCATCA AGTCTAATTC CTCTCACTCT GTGTGTGATG AAGGTGCTAT AGGTTATTGC 960 
ACTGACCATG AATCTAGTTC CCATCATGAC TTAGAAGGTG CTGTAGGTGG CTACTAOCCA 1020 
GAACCCAGTA AGCTTTGTCA CTTGGATCAA AGTGATTCTG ATTTCCATGG AGATCTTACA 1080 
TTTCAACACG TATTTCATAA (XACACTTAC CACTTACAGC CAACTGCAOC AGAATCTACT 1140 
TCTGAACCTT TTTXGTGGCC TGGGAAGTCA CAGAAGATAA GGAGTAGATA CCTTGAAGAC 1200 
ACAGATAGAA ACTTGAGCCG TGATGAACAG CGTGCTAAAG CTTTOCATAT CCCTTTTTCT 1260 
GTAGATQAAA TTGTCGGCAT GCCTGTTGAT TCTTTCAATA GCATGTTAAG TAG ATA1TAT 1320 
CTGACAGACC TACAAGTCTC ACTTATOCGT GACATCAGAC GAAGAGGGAA AAATAAAGTT 1380 
GCTGCGCAGA ACTGTCGTAA ACGCAAATTG GACATAATTT TGAATTTAGA AGATGATGTA 1440 
TGTAACTTGC AAGCAAAGAA GGAAACTCTT AAGAGAGAGC AAGCAGAATG TAACAAAGCT 1500 
ATTAACATAA TG AAACAGAA ACTGCATGAC CTTTATCATG ATATTTTTAG TAGATTAAGA 1560 
QAT GACCAA G GTAGGCXAGT CAATCOCAAC CACTATGCTC TCCAGTGTAC CCATGATGGA 1620 
AGTATCTTGA TAGTACCCAA AG AACTGGTG GOCTCAGGCC ACAAAAAGGA AACCCAAAAG 1680 
GGAAAGAGAA AGTGAGAAQA AACTGAAGAT GGACTCTATT ATGTGAAGTA GTAATGTTCA 1740 
GAAACTG ATT ATTTGGATCA GAAACCATTG AAACTGCTTC AAGAATTGTA TCnTAAGTA 1800 
CTGCTACTTG AATAACTCAG TTAACGCTGI 1 1 IGAAGCTT ACATGQACAA ATGTTTAGGA 1860 
CTTCAAGATC ACACTTGTGG GCAATCTGGG GGAGOCACAA CTTTTCATGA AGTGCATTGT 1920 
ATACAAAATT CATAGTTATG TOCAAAG AAT AGGTTAACAT G AAAACCCAG TAAGACTTTC 1980 
CATCTTGGCA GCCATCCTTT TTAAG AGTAA GTTGGTTACT TCAAAAAGAG CAAACACTGG 2040 
ggatcaaatt attttaagag GTATTTCAGT TTTAAATGCA AAATAGCCTT ATTTTCATTT 2100 
AG TTTGTTA O CACTATAGTG AGCTTTTCAA ACACTATTTT AATCTTTATA TTTAACTTAT 2160 
AAATTTTGCT TTCT 



5>q IP NO: 30 Protein sequence; 
Protein Accession ft NPJD04280 

1 U 21 31 41 51 
I I I I I I 

MNSSAHYHVN FSQAISQDVN LHEAILLCPN NTFRRDPTAR TSQSQEPFLQ LNSHTTNPEQ 60 
TLPGTNLTGF LSPVDNHMRN LTSQDIXYDL DINJFDEINL MSLATEDNFD PIDVSQLFDE 120 
PDSDSGLSLD SSHNNTSVIK SNSSHSVCDE GAIGYCTDHB SSSHHDLEGA VGGYYPEPSK 180 
LCHLDQSDSD F HGDLT FQHV FHNHTYHLQP TAPESTSEPF PWPGKSQiOR SRYLEDTDRN 240 
LSRDEQRAKA LHHTSVDEI VGMPVDSFNS MLSRYYLTDL QVSURDIRR RGKNKVAAON 300 
CRKRKLDIIL NLEDDVCNLQ AKKETLKREQ AQCNKAINIM KQKLHDLYHD IFSRLRDDQG 360 
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RPVNPNHYAL QCTHDGSIU VPKELVASGH KKETQKGKRK 



ScqlDNCh 31 DNA sequence 

Nude* Acid Accession ft NM_(B326£U 

Cotfing sequence 1-1208 

1 11 21 31 41 51 

ATOAAGTTGG AGGTGTTCGT OOCTOGOGCG GCCCACGGGG ACAAGCAGGG CAGTGACCTG 60 
GAGGGCGCGG GCGGCAGCGA OGCGCCGTCC COGCTGTCGG CGGOGGGAGA CGACTCCCTG 120 
GGCTCAGATG GGGACTGCGC GGCCAAGCCG TCCGCGGGCG GCGGCGCCAG AGATACGCAG 180 
GGCGACGGCG AACAGAGTGC GGGAGGOGGG CCGGGCGCGG AGGAGGCGAT CCCGGCAGCA 240 
GCTGCTGCAG CGGTGGTGGC GGAGGGCGCO GAGGCCGGGO CGGCGGGGCC AGGCGCGGGC 300 
GGOGCGGGGA GCGGCGAGGG TGCACGCAGC AAGCCATATA CGCGGCGGCC CAAGCCCCCC 360 
TACTCGTACA TCGCGCTCAT OGOCATGCCC ATCCGCGACT CGGCG GGCGG GCGCTTGACO 420 
CTGGCGG AG A TCAACGAGTA CCTCATGGGC AAGTTCCCCT TTTTOOGCGG CAGCTACACG 480 
GGCTGGCGCA ACTCCGTGCG CCACAACCTT TCGCTCAACG ACTGCTTCGT CAAGGTGCTG 540 
CGCG ACCCCT CGCGGCCCTG GGGCAAGGAC AACTACTGGA TGCTCAACCC CAACAGCGAO 600 
TACACCITCG CCGACGGGGT CTTCCGCCGC CGCCGCAAGC GCCTCAGCCA CCGCGCGCCG 660 
GTCCCCGCGC CCGGGCTGCG GCCCGAGGAG GCCCCGGGCC TCCCCGCCGC CCCGCCGCCC 720 
GCGCCCGCCG CCCCGGCCTC GCCCOGCATG CX3CTCGCCCG CCCGCCAGGA GGAGCGCGCC 780 
AGCCCCGCGG GCAAGTTCTC CAGCTCCTTC GCCATCGACA GCATCCTGCG CAAGCCCTTC 840 
CGCAGCCGTC GCCTCAG GG A CAOGGCOCCC GGGACGACGC TTCAGTGGGG CGCCGCGCCC^WO 

CCGCTCTGCG CGTACGGCGC GGGCGAGCCG GCGCGGCTGG GCGCGCGCGA GGCCGAGGTG 1020 
CCACCGACCG CGCCGCCCCT CCTGCTTGCA CCTCTCCCGG CGGCGGCCCC CGCCAAGCCA 1080 
CTCCGAGGCC CGGCGGCCGG CXiGCGCGCAC CTGTACTGCC CCCTGCGGCT GCCCGCAGCC 1140 
CTGCAGGCGG CCTTAGTCCG NCX3TCCTGGC CCGCACCTGT CGTACCCGGT GGAGACGCTC 1200 
CTAGCTTGA 

Seq ID NO: 32 Proton sequence: 
Protein Accession ft NP_1 50285.1 

1 11 21 31 41 51 

MKLEVFVPRA AHGDKQGSDL EGAGGSDAPS PLSAAGDDSL GSDGDCAAKP SAGGGARDTQ 60 
GDGEQSAGGG PGAEEAIPAA AAAAWAEGA BAGAAGPGAG GAGSGEGARS KPYTRRPKPP 120 
VSYTAI f AMA mnSAfifiRLT LAE1NEVLMQ KFPFFRGSYT GWRNSVRHNL SLKDCFVKVL 180 
RDPSRPWGKD NYWMLNPNSB YTFADGVFRR RRKRLSHRAP VPAPGLRPEE APGLPAAPPP 240 
APAAPASPRM RSPARQEERA SPAGKFSSSF AIDStLRKPF RSRRLRDTAP GTTLQWGAAP 300 
CPPLPAFPAL LPAAPCRALL PLCAYGAGEP ARLGAPJJAEV PPTAPPLLLA PLPAAAPAKP 360 
LRGPAAGGAH LYCPLRLPAA LQAALVRRPG PHLSYPVETL LA 

ScqtDNO: 33 DNA sequence 

Nucleic Acid Accession ft NM012128J 

Coding sequence: 43-2796 

1 II 21 31 41 51 

GAACAAACCA ACATTTGAGC CAGGAATAAC TAGAGAGGAA CAATGGGGTT ATTCAGAGGT 60 
T1TUUA1C C TCTTAGTTCT GTGCCTGCTG CACCAGTCAA ATACTTOCTT CATTAAGCTG 120 
AATAATAATG GCTTTGAAGA TATTGTCATT GTTATAGATC CTAGTGTGCC AGAAGATGAA 180 
AAAATAATTG AACAAATAGA GGATATGGTG ACTACAGCIT CTACGTACCT GTTTGAAGCC 240 
ACAGAAAAAA GATTTTTTTT CAAAAATGTA TCTATATTAA TTOCTGAGAA TTGGAAGGAA 300 
AATCCTCAGT ACAAAAGGCC AAAACATG AA AACCATAAAC ATGCTGATGT TATAGTIGCA 360 
CCACCTACAC TCOCAGGTAG AGATGAACCA TACAGCAAGC AGTTCACAGA ATGTGGAGAG 420 
AAAGGCQ AAT ACATTCACTT CACCCCTGAC CTTCTACTTG GAAAAAAACA AAA TG AA TAT 480 
GGACCACCAG GCAAACTGTT TGTCCATGAG TGGGCTCACC TCCGGTGGGG AGTGTTTGAT 540 
GAGTACAATG AAGATCAGCC TTTCTACCGT GCTAAGTCAA AAAAAATCGA AGCAACAAGG 600 
TGTTCCGCAQ GTATCTCTGG TAGAAATAGA GTTTATAAGT GTCAAGGAGG CAGCTGTCTT 660 
AGTAGAGCAT GCAGAATTGA TTCTACAACA AAACTGTATG GAAAAGATTG TCAATTCTTT 720 
CCTGATAAAG TACAAACAGA AAAAGCATCC ATAATGTTTA TGCAAAGTAT TGATTCTGTT 780 
GTTGAATTTT GTAACG AAAA AACCCATAAT CAAGAAGCTC CAAGCCTACA AAACATAAAG 840 
TGCAATTTTA GAAGTACATG GGAGGTGATT AGCAATTCTG AGGATTTTAA AAACACCATA 900 
CCCATGGTGA CACCACCTCC TCCACCTGTC TTCTCATTGC TGAAGATCCG TCAAAGAATT 960 
GTGTGCTTAG TTCTTGATAA GTCTGGAAGC ATGGGGGGTA AGGACCGCCT AAATCGAATG 1020 
AATCAAGCAG CAAAACATTT CCTGCTGCAG ACTGTTGAAA ATGGATCCTG GGTGGGGATG 1080 
GTTCACTTTG ATAGTACTGC CACTATTGTA AATAAGCTAA TCCAAATAAA AAGCAGTGAT 1140 
GAAAGAAACA CACTCATGGC AGGATTACCT ACATATCCTC TGGGAGGAAC TTCCATCTGC 1200 
TCTGO AATTA AATATGCATT TCAGGTOATT GGAGAGCTAC ATTCOCAACT CGATGGATCC 1260 
GAAGTACTGC TGCTGACTGA TGGGGAGGAT AACACTGCAA GTTCTTGTAT TGATGAAGTG 1320 
AAACAAAGTG GGGCCATTGT TCATTTTATT GCTTTGGGAA QAGCTGCTGA TGAAGCAGTA 1380 
ATAGAGATGA GCAAGATAAC AGGAGGAAGT CATTTTTATG TTTCAGATGA AGCTCAGAAC 1440 
AATGGCCTCA TTGATGCTTT TGGGGCTCTT ACATCAGGAA ATACTGATCT CTCCCAGAAG 1500 
TCCCTTCAGC TCGAAAGTAA GGGATTAACA CTGAATAGTA ATGCCTGGAT G AACGACACT 1560 
OTCATAATTG ATAGTACAGT GGGAAAGGAC ACGTTCTTTC TCATCACATG GAACAGTCTG 1620 
CCTCCCAGTA TTTCTCTCTG GGATCCCAGT GGAACAATAA TGGAAAATTT CACAGTGGAT 1680 
GCAACTTCCA AAATGGCCTA TCTCAGTATT CCAGGAACTG CAAAGGTGGG CACTTGGGCA 1740 
TACAATCTTC AAGCCAAAGC GAACOCAGAA ACATTAACTA TTACAGTAAC TTCTCGAGCA 1800 
GCAAATTCTT CTGTGCCTCC AATCACAGTG AATGCTAAAA TGAATAAGQA CGTAAACAGT 1860 
TTCCCCAGCC CAATGATTGT TTACGCAGAA ATTCTACAAG GATAT GTAOC TGTI CTTOGA 1920 
GCCAATGTOA CTGCTTTCAT TGAATCACAG AATGGACATA CAGAAGTTTT GGAACTTTTG 1980 
GATAATGGTG CAGGCGCTG A TTCTTTCAAG AATGATGGAG TCTACTCCAG GTATTTTACA 2040 
GCATATACAG AAAATGGCAG ATATAGCTTA AAAGTTCGGG CTCATGGAGG AGCAAACACT 2100 
GCCAGGCTAA AATTACGGCC TOCACTGAAT AGAGCCGCGT ACATACCAGG CTGGGTAGTG 2160 
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AAOGGGGAAA TTGAAGCAAA CCCGCCAAG A CCTGAAATTQ ATGAGGATAC TCAGACCACC 2220 
TTOG AGGATT TCAGCOOAAC ACCATCOGGA GGTGCATTTG TGGTATCACA AGTOCCAAGC 2280 
CTTOCCTTGC CTGAOCAATA OCCAOCAAGT CAAATCACAG ACCTTGATGC CACAGTTCAT 2340 
GAGGATAAGA TTATTCTTAC ATGGACAGCA CCAGGAGATA ATTTTGATGT TGGAAAAGTT 2400 
CAACCI TATA TCATAAGAAT AAGTGCAAQT ATTCTTQATC TAAGAGACAG TTTTGATGAT 2460 
GCTCTTCAAG T AAAT ACT AC TGATCTGTCA OCAAAGGAGG OCAACTOCAA GGAAAGCTTT 2520 
GCATTTAAAC CAQAAAATAT CTCAGAAGAA AATGCAACCC ACATATTTAT TGCCATTAAA 2580 
AGTATAGATA AAAGCAATTT GACATCAAAA GTATCCAACA TTGCACAAGT AACTTTGTTT 2640 
ATCCCTCAAG CAAATGCTGA TGACATTGAT CCTACTCCTA CTCCTACTCC TACTCCTGAT 2700 
AAAAGTCATA ATTCTGGAGT TAATATTTCT ACGCTGGT AT TGTCTGTGAT TGGGTCTGTT 2760 
GTAATTGTTA ACTTTATTTT AAGTACCACC ATTTGAACCT TAACG AAGAA AAAAATCTTC 2820 
AAGTAGACCT AGAAGAGAGT TTTAAAAAAC AAAACAATGT AAGTAAAGGA TATTTCTGAA 2880 
TCTTAAAATT CATCCCATGT GTGATCATAA ACTCATAAAA ATAATTTTAA GATGTCGGAA 2940 
AAGGATACTT TGATTAAATA AAAACACTCA TGGATATGTA AAAACTGTCA AGATTAAAAT 3000 
TTAATAGTTT CATTTATTTG TTATTTTATT TGTAAG AAAT AGTGATGAAC AAAGATCCTT 3060 
TTTCATACTG ATACCTGGTT GTATATTATT TGATGCAACA GTTTTCTGAA ATGATATTTC 3120 
AAATTGCATC AAGAAATTAA AATCATCTAT CTGAGTAGTC AAAATACAAG TAAAGGAGAG 3180 
CAAATAAACA ACATTTGGAA AAAAAAAAAA AAAAAAAA 

Seq fl) Na 34 ProtaaSSSSSE 
Protein Accession ft NP 03626O.I 

1 11 21 31 41 51 
(III! I 

MGLFRGFVFL LVLCLLHQSN TSFIKLNNNG FEDIVTVIDP SVPEDEKHB QIEDMVTT AS 60 
TYLFBATEKR FFFKNVSDJ PENWKBKFQY KRPKHENHKH ADVIVAPPTL PGRDEPYTKQ 120 
FTECGEKGEY IHFTPDLULG KKQNEYGPPG KLFVHEWAHL RWGVFDEYNH DQPFYRAKSK ISO 
KIEATRCSAG ISGRNRVYKC QGGSCLSRAC RIDSTTKLYG KDCQFFPDKV QTEKASIMFM 240 
QSDSVVEFCNEKIH^EAPSU}^ 300 
fORQRIVCLV LDKSGSMGGK DRLNRMNQAA KHFLLQTVEN GSWVGMVHFD STATTVNKLI 360 
QIKSSDERNT LMAGLFTYPL GGTSICSGIK YAFQVIGELH SQLDGSEVLL LTDGEDNTAS 420 
SQDEVKQSG AIVHFIALGR AADEAVIEMS KTTGGSHFYV SDEAQNNGIi DAFGALTSGN 480 
TDLSQKSLQL ESKGLTLNSN AWMNDTVIID STVGK0TFFL ITWNSLPPS1 SLWDPSGTIM 540 
ENFTVDATSK MAYLSIPGTA KVGTWAYNLQ AKANPBTLTI TVTSRAANSS VPPTTVNAKM 600 
NKDVNSFPSP MTVYAHLQG YVPVLG ANVT AFffiSQNGHT EVmXDNGA GADSFKNDGV 660 
YSRYFTAYTE NGRYSLKVRA HGGANTARLK LRPPLNRAAY EPGWWNGEI EANPPRPEID 720 
EDTQTTLEDF SRTASGGAFV VSQVPSLPLP DQYPPSQITD LDATVHEDKI ILTWTAPGDN 780 
FDVGKVQRYI IRISASILOL RDSFPDALQ V NTTDLSPKEA NSKESFAFKP ENISEENATH 840 
IFIAKSIDK SNLTSKYa« AQVTLFIPQA KPDDIDPTPT PTPTPDKSHN SGVNBTLVL 900 
SVIGSWTVN FILSTTI 

SeqlDNO: 35 DNA sequence 

Nucleic Acid Accession NM_000901.1 

Coding sequence: 217-3171 

1 11 21 31 41 51 
I I I I I I 

CGCGGG AGCC AACTTCAGGC TGCTCAGAGG AAGCCCGTGC AGTCAGTCAC CTGGGTGCAA 60 
G AGOGTTGCT GCCTCGGGCT CTOOOGCTGC AGGGAGAGCG GCACTCGCTG GCCTGGATGT 120 
GGTTGGATTT AGGGGGGCTC CGCAGCAGGG GTTTCGTGGC GGTGGCAAGC GCTGCAACAG 180 
GTAG ACGGCG AGAGAOGGAC CCCGGCCGAG GCAGGGATGG AGACCAAAGG CTACCACAGT 240 
CTCCCTGAAG GTCTAGATAT GGAAAGACGG TGGGGTCAAG TTTCTCAGGC TGTGGAGCGT 300 
TCTTCOCTGG GACCT ACAGA G AGGACCGAT GAGAATAACT ACATGGAGAT TGTCAACGTA 360 
AGCTGTGTTT CCGGTGCTAT TCCAAACAAC AGTACTCAAG GAAGCAGCAA AGAAAAACAA 420 
GAACTACTCC CTTGCCTTCA GCAAGACAAT AATCCGCCTG GGATTTTAAC ATCTGATATT 480 
AAAACTG AGC TGG AATCTAA GGAACTTTCA GCAACTGTAG CTGAGTCCAT GGGTTTATAT 540 
ATGGATTCTG TAAGAGATGC TGACTATTCC TATGAGCAGC AGAACCAACA AGGAAGCATG 600 
AGTCCAGCTA AGATTTATCA GAATGTTGAA CAGCTGGTGA AATTTTACAA AGGAAATGGC 660 
CATCGTCCTT CCACTCTAAG TTGTGTGAAC ACGCCCTTGA GATCATTTAT GTCTGACTCT 720 
GGGAGCTCCG TGAATGGTGG CGTCATGCGC GCCATTGTTA AAAGOOCTAT CATGTGTCAT 780 
QAGAAAAGCC CGTCTGTTTG CAGCCCTCTG AACATQACAT CTTCGGTTTG CAGCCCTGCT 840 
GGAATCAACT CIGTGTCCTC CACCACAGCC AGCTTTGGCA GTTTTCCAGT GCACAGCCCA 900 
ATCACCCAGG GAACTCCTCT GACATGCTCC CCTAATGCTG AAAATCGAGG CTCCAGGTCG 960 
CACAGCCCTG CACATGCT AG CAATGTGGGC TCTCCTCTCT CAAGTOCGTT AAGTAGCATQ 1020 
AAATCCTCAA TTTCCAGCCC TCCAAGTCAC TGCAGTGTAA AATCTCCAGT CTCCAGTCCC 1080 
AATAATGTCA CTCTGAGATC CTCTGTGTCT AGCCCTGCAA ATATTAACAA CTCAAGGTGC 1140 
TCTGTTTCCA GCCCTTCG AA CACTAATAAC AG ATCCACGC TTTCCAGTCC GGCAGCCAGT 1200 
ACTGTGGGAT CT ATCTGT AG COCTGT AAAC AATGCCTTCA GCT ACACTGC TTCTGGCACC 1260 
TCTGCTGGAT CCAGT ACATT GCGGGATGTG GTTCCCAGTC CAG ACACGCA GG AGAAAGGT 1320 
GCTCAAGAGG TCOCTTTTOC TAAGACTGAG GAAGTAGAGA GTGCCATCTC AAATGGTGTG 1380 
ACTGGCCAGC TTAATATTGT CCAGTACATA AAACCAGAAC CAGATGGAGC TTTTAGCAGC 1440 
TCATGTCTAG GAGGAAATAG CAAAATAAAT TCGGATTCTT CATTCTCAGT ACCAATAAAG 1500 
CAAG AATCAA CCAAGCATTC ATGTTCAGGC AOCTCTTTTA AAGGGAATCC AACAGTAAAC 1560 
CCGTTTCCAT TTATG QATGG CTCGTATTTT TCCTTTATGG ATGATAAAGA CTATTATTCC 1620 
CTATCAGGAA TTTTAGGACC ACCTGTGCCC GGCTTTGATG GTAACTGTGA AGGCAGCGGA 1680 
TTCCCAGTGG GTATTAAACA AGAACCAGAT GACGGGAGCT ATTACCCAGA GGCCAGCATC 1740 
CCrTCCICTG CTATTGTTGG GGTGAATTCA GGTGGACAGT CCTTCCACTA CAGGATTGGT 1800 
GCTCAAGGTA CAATATCTTT ATCACGATCG GCTAGAGAOC AATCTTTCCA ACACCTGAGT 1860 
TCCnTTCCTC CTGTCAATAC TTTAGTGG AG TCATGGAAAT CACACGGCG A CCTGTCGTCT 1920 
AGAAGAAGTG ATGGGTATCC GGTCTTAGAA TACATTCCAG AA AATG TATC AAGCTCTACT 1980 
TTAOGAAGTG TTTCTACTGG ATCTTCAAGA CCTTCAAAAA TATGTTTGGT GTGTGGCGAT 2040 
GAGGCTTCAG GATGOCATTA TGGGGT AGTC AOCTGTGGCA GCTGCAAAGT TTTCTTCAAA 2100 
AGAGCAGTGG AAGGGCAACA CAACTATTTA TGTGCTGGAA GAAATGATTG CATCATTGAT 2160 
aagattcgac G AAAGAATTG TCCTGCTTGC AGACTTCAGA AATGTCTTCA AGCTGGAATG 2220 
AATTTAGGAG CACGAAAGTC AAAGAAGTTG GGAAAGTTAA AAGGGATTCA CGAGGAGCAG 2280 
CCACAGCAGC AGCAGCCCCC ACCCCCACCC CCACCCCCGC AAAGCCCAGA GG AAGGGACA 2340 
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ACGTACATCG CTCCTGCAAA AG AACCCTCG GTCAACACAQ CACTGGTTCC TCAGCTCTCC 2400 
ACAATCTCAC GAGCGCTCAC ACCTTCOOOC GTTATQGTOC TTGAAAACAT TGAACCTGAA 2460 
ATTGTATATG CAGGCTATGA CAGCTCAAAA CCAGATACAG CCGAAAATCT GCTCTCCACG 2520 
CTCAACCGCT TAGCAGGCAA ACAGATGATC CAAGTCGTGA AGTGGGCAAA GGTACTTCCA 2580 
GGATTTAAAA ACTTGCCTCT TGAGGAOCAA ATTACCCTAA TCCAGTATTC TTGGATGTGT 2640 
CTATCATCAT TTGCCTTGAG CTGGAGATCG TACAAACATA CGAACAGCCA ATTTCTCTAT 2700 
TTTGCACCAG AOCTAGTCTT TAATGAAGAG AAGATGCATC AGTCTGCCAT GTATGAACTA 2760 
TGCCAGGGGA TGCACCAAAT CAGCCTTCAG TTCGTTOGAC TGCAGCTCAC CTTTGAAGAA 2820 
TACACCATCA TGAAAGTTTT GCTGCTACTA AGCACAATTC CAAAGGATGG CCTCAAAAGC 2880 
CAGGCTGCAT TTGAAGAAAT G AGGACAAAT TACATCAAAG AACTGAGGAA GATGGTAACT 2940 
AAGTGTCCCA ACAATTCTGG GCAQAQCTQQ CAGAGGTTCT ACCAACTGAC CAAGCTGCTG 3000 
G ACTCCATGC ATGACCTGGT GAGCGACCTG CTGG AATTCT GCTTCTACAC CTTOCGAGAG 3060 
TCCCATGCGC TGAAGGTAGA GTTCCCOGCA ATGCTGGTGG AG ATCATCAG OGACCAGCTG 3120 
CCCAAGGTGG AGTCGGGGAA CGCCAAGCCG CTCTACTTCC ACCGG AAGTG ACTGCCCGCT 3180 
GGOCAGAAGA ACTTTGCCTT AAGTTTCOCT GTGTTGTTCC ACACCCAGAA GGACCCAAGA 3240 
AAACCTGTTT TTAACATGTG ATGGTTGATT CACACTTGTT CAACAGTTTC TCAAGTTTAA 3300 
AGTCATGTCA GAGGTTTGGA GCCGGGAAAO CT OTTTTTO C GTGGATTTGG CGAGACCAGA 3360 
GCAGTCTGAA GGATTCCCCA CCTCCAATOC CCCAGCGCTT AG AAACATGT TCCTGTTCCT 3420 
CGGGATG AAA AGCCAT ATCT AGTCAATAAC TCTG ATTTTG ATATTTTCAC AGATGGAAGA 3480 
A G T ITT AACT ATGCCGTGTA GTrTCTGGTA TCGTTCGCTT GTTTTAAAAG GGTTCAAGGA 3540 
CTAACGAACG TTTTAAAGCT TACXXTfGGT TTGCACATAA AACGTATAGT CAAT ATGGGG 3600 
CATTAAT ATT CTTTTGTTAT T AAAAAAACA CAAAAAAAT A AT AAAAAAAT AT ATACAG AT 3660 
TCCTGTTGTG TAATAACAGA ACTCGTGGCG TGGGGCAGCA GCTGCCTCTG AGCCCTCGCT 3720 
CGTOCACGGT CTTCTGCATC ACTGGTATAC ACACTCGTTA GCGTCCATTT CTTATTTAAT 3780 
TAGAATGGAT AAGATG ATGT TAAATGOCTT GGTTTGATTT CTAGTATCTA TTGTGTTGGC 3840 
TTTACAAATA Al I ' l ' lT I U CA G T C l'l'l' lti CT GTGCTGTAGA TTACTGTATG TATAAATTAT 3900 
GAAGG ACCTG AAATAAGGTA TAAGGATCTT TTGTAAATGA GACACATACA AAAAAAATCT 3960 
TTAATGGTTA ATAGG ATGAA TGGGAAAGTA TTTTTGAAAG AATTCTATTT TGCTGGAGAC 4020 
TATTTAAGT A CTATCTTTGT CTAAACAAGG TAATTTTTTT TTGTAAAGTG CAATGTCCTG 4080 
CATGCATAAT G AACCGTTTA CAGTGTATTT AAG AAAGGGA AAGCTGTGCC TTTTTTAGCT 4140 
TCATATCTAA TTTACCATTA TTTTACAGTC TC TOTTOTAA ATAACCACA C TGAAACCTCT 4200 
TCGGTTGTCT TGAAACXTTT CTACrTTTTC TGTACTTTTT GTTTTGTTCT TGGTCTCCCG 4260 
CTTGGGGCAT TTGTGGG ACT CCAGCACGTT TTCTGGCTTC TGCTTCATCC TGCTCCATCG 4320 
GGGAATG ACA CACTGCGGTG TCTGCAGCTC CTGGAAGGTG TCATTTG ACA ACACATGTGG 4380 
GAGAGGAGGT CCTTGGAGTG CTGCAGCTTT GGGAAAGCCT GCCTCGTTTC CCTTTTCCTC 4440 
TAG AAGCAGA AOCAGCTCTA CGAGAGTGAG ACTGGGAACT TGATGGCTCA GAGAGCATCT 4500 
TTTCCTCCCA TTTTAGAAAA TCAG ATTTTC TCCTGTGGGA AAAAAAAATT CCATGCACTC 4560 
TCTCTCTGTT AAAGATCAGC TATTCCCTTC TGATCTTGGA AAGAGGTTCT GCACTCCTGG 4620 
AACOGGTCAC AGGAACGCAC AGATCATGGC AGGATGCGCT GGGACGGCCC ATCTTGGCAA 4680 
GGTTCAGTCT GAATGGCATG GAGACCGGG A GATAGAGGGG TTTTAGATTT TTAAAAGGTA 4740 
GGTTTTAAAA ATAAGTTTTA TACATAAACA GTTTTGGAGA AAAATTACAG ATCATATAAG 4800 
CAAGACAGTG GCACTAAAAT GTTTAATTCA TTAATCTGTT TGTTTGGCAC TGATGCAATG 4860 
TATGGCnTT CTCTTGCCCC AAATCACAAA CATATGTATC TTTGGGGAAA CTAACAATAT 4920 
GATTGCACTA AATAAACTAC TTTGAATAGA GGCCAAATTA ATCTTTTAAA AATGATGATA 4980 
ATCATCAGGT TTACTCAGTG AAATCATATT AATTATTTTC CAAAATCTAA AAGCTGTAGC 5040 
TGGAGAAGCC CATGGCCACG AGGAAGCAGC AATTAATTAG ATCAACACTT TTCTCCAGGG 5100 
TTCACCATGC AGGCAACATT ACCTTGTCTT TCAAAAGACA CCTGCCTTAG TGCAAGGGGA 5160 
AACCTGTGAA AGCTGCACTC AGAGGGAGGA GTCTTTCTTA CATAATTTGC AATTTCAGGA 5220 
ATTTAATTTA TAGGCAGATC TTTAAATACA GTCAACTTAC GGTGCACAGT AATATGAAAG 5280 
CCACACTTTG AAG GTAATAA ATACACAGCA TGCAGACTGG GAGTTGCTAG CAAACAAATG 5340 
GCTTACTTAC AAAAGCAGCT TTTAGTTCAG ACTTAGTTTT TATAAAATG A GAATTCTGAC 5400 
TTACTTAACC AGGTTTGGGA TGGAGATGGT CTGCATCAGC TTTTTGTATT AACAAAGTTA 5460 
CTGGCTCTTT GTGTGTCTCC AGGTAACTTT GCTTGATTAA ACAGCAAAGC CATATTCTAA 5520 
ATTCACTGTT G AATGCCTGT CCCAGTCCAA ATTGTCTGTC TGCTCTTATT TTTGTACCAT 5580 
ATTGCTCTTA AAAATCTTGG TTTGGTACAG TTCATAATTC ACCAAAAAGT TCATATAATT 5640 
TAAAG AAACA CTAAATTAGT TTAAAATGAA GCAATTTATA TCTTTATGCA AAAACATATG 5700 
TCTGTCTTTG CAAAGG ACTG TAAGCAGATT ACAATAAATC CTTTACTTT 



Sw| ID NO: 36 Protem sequence: 
Protein Accession ik NPJD00892.1 

1 11 21 31 41 51 

I I I I I I 

METKGYHSLP EGLDMERRWG QVSQA VERSS LGPTESTDBN NVMETVNVSC VSGAIPNNST 60 
QGSSKEKQEL LPCLQQDNNR PG1LTSDIKT ELESKELSAT V AESMGLYMD SVRDADYSYE 120 
QQNQQGSMSP AK TYQNV BQL VKFYKGNGHR PSTLSCVNTP LRSFMSDSGS SVNGGVMRAI 180 
VKSPIMCHEK SPSVCSPLNM TSSVCSPAGI NSVSSTTASF GSFPVHSPIT QGTPLTCSPN 240 
AENRGSRSHS PAHASNVGSP LSSPLSSMKS SISSPPSHCS VKSPVSSPNN VTLRSSVSSP 300 
ANINNSRCSV SSPSNTNNRS TLSSPAASTV QgCSPVNNA FSYTASGTSA GSCTLRDWP 360 
SPDTQEKG AQ EVPFPKTEEV ESAISNGVTG QLNTVQYIKP EPDG AFSSSC LGGNSK3NSD 420 
SSFSVPKQE STKHSCSGTS FKGNPTVNPF PFMDGSYFSF MDDKDYYSLS GILGPPVPGF 480 
DGNCEGSGFP VGDCQEPDDG SYYPEASIPS SAJVGVNSGG QSFHYRIGAQ GTJSLSRSAR 540 
DQSFQHLSSF PPVNTLVBSW KSHGDLSSRR SDGYPVLEYI PENVSSSTLR SVSTGSSRPS 600 
KKXVCGDEA SGCHYGWTC GSCKVFFKRA VEGQHNYLCA GRNDCUDKI RKKNCPACRL 660 
QKCLQ AGMNL GARKSKKLGK LKGIHEEQPQ QQQPPPPPPP PQSPEEGTTY IAPAKEPSVN 720 
T ALVPQLSTI SRALTPSPVM VLENIEPETV Y AOYDSSKPD T AENLLSTLN RLAGKQMIQV 780 " 
VKWAKVLPGF IQ^LPLEDQIT UQYSWMCLS SFALSWRSYK HTNSQFLYFA PDLVFNEEKM 840 
HQSAMYELCQ GMHQISLQFV RLQLTFEEYT IMKVLLLLST 1PKDGLKSQA AFEEMRJNYI 900 
KELRKMVTKC PNNSGQSWQR FYQLTKLLDS MHDLVSDLLE FCFYTFRESH ALKVEFPAML 960 
VEHSDQLPK VESGKAKPLY FHRK 

Sm ID NO; 37 DNA seqppic<» 

Nucleic Acid Accession ft see Table 25 &25A for complete list 
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CCTAACAACT A^aSa^A^A^A^^^^^,^™ 01 «° 
GCAATAAATA CAG ATC<K;AC TACA^ AA^ rrT^Sv^SS^ 5i ^^^^^'^ TTTAATAATA $40 

Seqfl>NQ:38DNAa 



NntoAddAcctssioDft NM 001192.1 

Ccxfing sequence 219-773 



GGACTGGTGA TCAAATT^VTT ^^^^GMI^^^t^k^^^T^^^^ 3 Q AAAAGAGCA 540 
<™AAOACTOCATCA^£^ 

CTAT(KSA<X3AA£H5C^^ *» 

TTTOGACT0O AGCACnCK^S CTTIAAA^ TAATTAAGCA 780 

ACTCnTATQ TTAGATATAT TTCl^rAGRT tjS*t£™tuE£ JJ^^OCTCT AACTOTGGAA 900 
CTIXKrTTTCA TCATTAAAGT SttS^^?otS^^^^ AGCTT AATGG TAG AAACTTC 960 



ScqIDNO:39| 

ftotetn Accession ft NPOM 183.1 

1 H 21 31 41 51 



ISAR —» •IR.INUVUKSLPAALSATHHKS 180 

Scq ID NO: 40 DNA i 



^^^^m^2 NMQ55087.I 



1 11 ( 21 31 41 51 

CGATGACCTC GCTGTGGA^GAA^C^S^^^^P^^HEf^^ 180 
CTCTCTAOCA TCACCTG^C^^A^A^^^S^I? 1 ' OTTTCITC< ^ T 240 



^^XX^GCTGC^^^^^^^^^^^ TACCTGCACA 1000 
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CCATGTGGCT AGGGG AAAAG TTGGGTTTCT ATACAGACTT TGGTCCAA6C ACAAGGTATC 1680 
ACACTTGGGG GATTATGGCT TTGTCAAGAT ACOCAATTOT GAAATCTGAG GATCACCTTC 1740 
TTCCGTCACC AOAGGGCQAG ATCGCACCAO CCATCACATT GACOGTTAAC ATTTCGGGCA 1800 
AGCTGGTGGA TTTTGTCGTG ACACACTTTG GG AAOCACGA AGATGACCTC GACAGGAAAC I860 
TGCAGGCTAT TGCTGTTTCA AAACTACTGA AAAGTAGCTC TAATCAAGTG ATATTTCTGG 1920 
GATATATCAC TTCAGCACCT GGCTCCAGAG ATTATCTACA GCTCACTGAA CATGGCAATG 1980 
TGAAGGATAT CGACAGCACT QATCATGACA GATGGTGTGA ATACATTATG TATCGAGGGC 2040 
TGATCAGGTT GGGTTATGCA AGAATCTCCC ATGCTGAACT GAGTGATTCA GAAATTCAGA 2100 
TGGCAAAATT TAGGATCCCT GATGAOCCCA CTAATTATAG AGACAACCAG AAAGTGGTCA 2160 
TAQACCACAG AGAAGTTTCT GAGAAAATTC ATTTTAATOC CAGATITGGA TCCTACAAAG 2220 
AAGGACACAA TTATGAAAAC AACCATAATT TTCATATGAA TACTCCCAAA TACTTTTTAT 2280 
GAAACATTTA AAACAAG AAG TTATTGGCTG GGAAAATCTA AGAAAAAAAG TATGTAAGAT 2340 
AAAAAGAAGA G ATTAATGAA AGTGGGAAAA TACACATGAA GAACCTCAAC TTAAAAAACA 2400 
CATGGTATCT ATGCAGTGGG AAATTACCTC CATTTGTAAA CTATGTTGCT TAATAAAAAC 2460 
ATTTCTCTAA AAAAAAAAAA AAAAAA 

Scq ID NO: 41 Protefn sequence: 
Proton Accessioa #: NP JQ79363.I 



1 11 21 31 41 51 
I I I I I I 

MTSLWREILL ESLLGCVSWS LYHDLGPMIY YFPLQTLELT GLEGFSIAFL SHFLTTTPF 60 
WKLVNKKWML TLLRHTIGS IASFQAPNAK LRLMVLALGV SSSLTVQAVT WWSGSHLQRY 120 
LRIWGF1LGQ IVLWLRIWY TSLNPIWSYQ MSNKVILTLS AIATLDRIGT DGDCSKPEEK 180 
KTGEVATGMA SRPNWLLAGA AFGSLVFLTH WWGEVSLVS RWAVSGHPHP GFDFNPFGGA 240 
VLLCLASGLM LPSCLWFRGT GLIWWVTGTA SAAGLLYLHT WAAAVSGCVF AIFTASMWPQ 300 
TLGHLINSGT NPGKTMTIAM IFYT.T.RTFFC AWCTAFKFVP GGVYARERSD VLLGTMMLU 360 
GLNMLFGPKK NLDLLLOTKN SSKVLFRKSB KYMKLFLWLL VGVGLLGLGL RHKAYERKLG 420 
KVAPTKEVSA AIWPFRFGYD NEGWSSLERS AHLLNETG AD FTTILESDAS KPYMGNNDLT 480 
MWLGEKLGFV TDFGPSTRYH TWGIMALSRY HVKSBHHLL PSPEGHIAPA ITLTVNISGK 540 
LVDFWTHFG NHEDDLDRKL QAIAVSKLLK SSSNQVXFLG YtTSAPGSRD YLQLTEHGNV 600 
KDfDSTDHDR WCBY1MYRGL IRLGYARISH AELSDSEIQM AKFRIPDDPT NYRDNQK WI 660 
DHREVSEKIH FNPRFGSYKE GHNYENNHNF HMNTPKYFL 
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It is understood that the examples described above in no way serve to limit the 
true scope of this invention, but rather are presented for illustrative purposes. All 
publications, sequences of accession numbers, and patent applications cited in this 
specification are herein incorporated by reference as if each individual publication or patent 
application were specifically and individually indicated to be incorporated by reference. 
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W^BCLAI^ 

1 1 . A method of detecting a metastatic colorectal cancer-associated 

2 transcript in a cell from a patient, the method comprising contacting a biological sample from 

3 the patient with a polynucleotide that selectively hybridizes to a sequence at least 80% 

4 identical to a sequence as shown in Tables 1-26. 

1 2. The method of claim 1, wherein the biological sample comprises 

2 isolated nucleic acids. 

1 3 . The method of claim 1 , wherein the polynucleotide is labeled 

1 4. The method of claim 1, wherein the polynucleotide is immobilized on 

2 a solid surface. 

1 5. An isolated nucleic acid molecule consisting of a polynucleotide - 

2 sequence as shown in Tables 1-26. 

1 6. An expression vector comprising the nucleic acid of claims. 

1 7. A host cell comprising the expression vector of claim 6. 

1 8. An isolated polypeptide which is encoded by a nucleic acid molecule 

2 having polynucleotide sequence as shown in Tables 1-26. 

1 9. An antibody that specifically binds a polypeptide of claim 8. 

1 10. The antibody of claim 10, which is an antibody fragment 

1 11. The antibody of claim 10, which is a humanized antibody ^ 

1 12. A method of detecting a metastatic colorectal cancer cell in a 

2 biological sample from a patient, the method comprising contacting the biological sample 

3 with an antibody of claim 9. 

1 13. The method of claim 12, wherein the antibody is labeled 

1 14. A method of detecting antibodies specific to metastatic colorectal 

2 cancer in a patient, the method comprising contacting a biological sample from the patient 

3 with a polypeptide encoded by a nucleic acid comprises a sequence from Tables 1-26. 
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^ IS. A method for identifying a compound that modulates a metastatic 
colorectal cancer-associated polypeptide, the method comprising die steps o£ 

(i) contacting the compound with a metastatic colorectal cancer-associated 
polypeptide* the polypeptide encoded by a polynucleotide that selectively hybridizes to a 
sequence at least 80% identical to a sequence as shown in Tables 1-26.; and 

(ii) detennining the functional effect of die compound upon the polypeptide. 

16. The method of claim 15, wherein the functional effect is determined by 
measuring ligand binding to the polypeptide. 

17. A method of inhibiting proliferation of a metastatic colorectal cancer- 
associated cell to treat colorectal cancer in a patient, the method comprising the step of 
administering to the subject a therapeutically effective amount of a compound that modulates 
a polypeptide encoded by a sequence as shown in Tables 1-26. 

18. A drug screening assay comprising the steps of 

(i) administering a test compound to a mammal having colorectal cancer or a 
cell isolated therefrom; 

(ii) comparing the level of gene expression of a polynucleotide that selectively 
hybridizes to a sequence at least 80% identical to a sequence as shown in Tables 1-26. in a 
treated cell or mammal with the level of gene expression of the polynucleotide in a control 
cell or mammal, wherein a test compound that modulates the level of expression of the 
polynucleotide is a candidate for the treatment of colorectal cancer. 

19. A pharmaceutical composition for treating a mammal having colorectal 
cancer, the composition comprising a compound identified by the assay of claim 18 and a 
physiologically acceptable excipient 

20. A method of detecting a metastatic colorectal cancer-associated 
polypeptide in a cell from a patient, the method comprising contacting a biological sample 
from the patient with a antibody that that specifically binds a polypeptide encoded by a 
nucleic acid molecule having polynucleotide sequence as shown in Tables 1-26. 

21. The method of claim 21, wherein the antibody is labeled. 
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